9 



(19) 



(12) 




Europaisches Patentamt 
European Patent Ofti.ee 
Office europeen o'es brevets 




(43) Date of publication: 

08.03.2000 Bulletin 2000/10 



(n) EP 0 984 616 A2 

EUROPEAN PATENT APPLICATION 

(51) IntCI.': H04N 1/32 



8 

UJ 
— J 

5 

I 

I- 

UJ 
CD 



(21) Application number: 99306946.7 

(22) Date of filing: 01.09.1999 



(84) 


Designated Contracting States: 


(72) 


Inventors: 




AT BE CH CY DE DK ES Fl FR GB GR IE IT LI LU 


• 


Ogawa, Hiroshi 




MC NL PT SE 




ku, Shinjuku-ku, Tokyo 163-1419 (JP) 




Designated Extension States: 


• 


Tomioka, Atsuki 




AL LT LV MK RO SI 




ku, Shinjuku-ku, Tokyo 163-1419 (JP) 






• 


Nakamura, Takao 


(30) 


Priority: 04.09.1998 JP 25119398 




ku, Shinjuku-ku, Tokyo 163-1419 (JP) 


08.02.1999 JP 3054099 




Takashima, Youichi 




18.02.1999 JP 4071399 




ku, Shinjuku-ku, Tokyo 163-1419 (JP) 




26.04.1999 JP 11807299 










(74) 


Representative: Rees, Alexander Ellison et al 


(71) 


Applicant: Nippon Telegraph and Telephone 




Urquhart-Dykes & Lord 


Corporation 




91 Wimpole Street 




Tokyo 100-8116 (JP) 




London W1M 8AH (GB) 



(54) Method and apparatus for digital watermarking 



(57) A method for embedding digital watermark data 
in digital data contents includes the steps of obtaining a 
frequency coefficient of block data of digital data con- 
tents, obtaining a complexity of the block data, obtaining 
an amount of transformation of the frequency coefficient 
from the complexity and the digital watermark data, and 
embedding the digital watermark data by transforming 
the frequency coefficient. In addition, a method for read- 
ing digital watermark data includes the steps of calcu- 



lating a probability of reading T or '0' in a read bit se- 
quence by using a test method on the basis of binary 
distribution, determining the presence or absence of 
digital watermark data according to the probability, and 
reconstituting digital watermark data. Another method 
includes the steps of performing soft decision in code 
theory by assigning weights to the digital watermark se- 
quence with a weighting function, and reconstituting dig- 
ital watermark data. 
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BACKGROUND OF THE INVENTION 

1 . Field of the invention 

[0001] The present invention generally relates to a digital watermarking technique. More particularly, the present 
invention relates to a digital watermarking technique for embedding or reading digital watermark data in digital data 
contents which represent an image or a sound. In addition, the present invention relates to a technique for statistical 
processing of read watermark data in a system using the digital watermarking technique. 

[0002] It is easy to replicate or tamper fraudulently with multimedia production, and the easiness hinders an data 
content provider from sending data. In addition, some users may not use the data originated from the provider validly. 
Therefore, copyright protection is strongly needed for the multimedia production. The digital watermarking technique 
is effective in realizing the copyright protection. According to the digital watermarking technique, sub-data is embedded 
in data contents without being noticed by a user by utilizing redundancy of data such as of an image and a sound. The 
digital watermarking technique is used for protecting a multimedia copyright by embedding copyright information, a 
user ID and the like as the sub-data in secret, since it is difficult to separate the sub-data from the data contents. 

2. Description of the Related Art 

[0003] Conventionally, the following digital watermarking techniques are proposed. 

[0004] According to a technique proposed in Japanese patent application No. 9-5751 6. "Image processing method 
and the apparatus." an image is subdivided into blocks larger than a 8X8 block size which is used for common non 
reversible compression. Then, the size of the frequency coefficient which is obtained by discrete Fourier transform of 
the block is changed, the frequency coefficient being represented by a polar coordinate system and the size being a 
distance from the origin point of the polar coordinate system. As a result, sub-data can be read correctly even when 
the non-reversible compression is performed. In addition, the frequency coefficient is normalized within a range of 
predetermined values, is embedded, and read. In addition, weaker image processing is carried out on a complicated 
region as compared to a flat region. As a result, degradation of image quality which may be caused by embedding the 
sub-data can be suppressed and a tolerance to contrast changing is obtained. Further, as the value of the frequency 
coefficient to be changed becomes larger, the modification amount of the frequency coefficient becomes larger (the 
smaller the value is. the smaller the modification amount is) so as to suppress the deterioration of image quality more 
effectively. In addition, when subdividing an image into blocks, an image area which is smaller than one block is treated 
as one block by using an average^pixel value and/or using a form symmetric with respect to a line repeatedly to com- 
pensate for the lacking image area. Moreover, the sub-data is constituted from the whole image after weighting data 
of each block. As a result, the sub-data is read correctly even when the image is partly edited and/or the image with 
many flat parts is non-reversibly compressed. 

[0005] In addition, according to a technique proposed in Japanese patent application f\Jo. 9-1 64466. "Information 
embedding method, data reading method and the apparatus." when embedding data into motion pictures, data em- 
bedding is carried out to components of a relatively low frequency region. Further, frequency conversion is carried out 
with a block size largerthan a b'ock size used for data compression, and. then data embedding is carried out. Moreover, 
an original image is used when data is read. As a result, tolerance to data compression is obtained. 
[0006] Other conventional techniques are proposed in Japanese patent applications No.8-305370. No. 8-338769. 
No. 9-9812. No. 9-14388. No. 9-109924. No. 9-197003. No. 9-218467 and No. 10-33239. The digital watermark method 
is also called data hiding, finger printing steganography, image/sound deep encryption and the like. 
[0007] Elements for determining performance of the digital watermarking technique are as follows: 

(1) quality of data contents in which the digital watermark is embedded; 

(2) durability of the digital watermark which is embedded in the data contents when the data' contents are manipu- 
lated; 

(3) safety against intentional erasing of and tampering with digital watermark data, and 

(4) reliability of the digital watermark data which is read from the data contents. 

[0008] The digital watermarking technique is broadly divided into two methods. One method of gives meaning to a 
data value by quantizing. For example, by dividing a data value by a quantization value and dividing the result by 2. a 
bit data can be represented by the remainder. Another method embeds digital watermark data by using a spread 
spectrum method. 

[0009] The above-mentioned examples are based on the former method. In terms of the method, there is a problem 
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with respect to the above element (1) in that the digital watermark data embedded in the data contents may be per- 
ceived, or commercial value of the data contents may be lost by embedding the digital watermark data. With respect 
to the above element (2). the digital watermark data which is embedded in the data contents may be dissipated even 
when a general user uses the data contents in a normal way. Particularly, it is a difficult problem to achieve both 

5 elements (1 ) and (2) with enough performance in practical use. 

[0010] In addition, there is a method of embedding the digital watermark data repeatedly in order to give durability 
to the digital watermark data against manipulation of the data contents. Specifically, according to the method, digital 
watermark data which is embedded repeatedly (which is called a watermark sequence hereinafter) is read from data 
contents, and. then, the digital watermark data is reconstituted by performing statistical processing. The watermark 

io sequence has durability against deterioration and noise to some extent. However, if the data contents are encoded by 
high compression rates, it may become difficult to read the watermark sequence from the data contents. Therefore, it 
may become impossible to reconstitute the digital watermark data. 

[0011] In addition, as for a digital watermarking system, accuracy for determining the presence or absence of em- 
bedded data is important. In addition, reliability of embedded data is important. The digital watermarking system gen- 
15 erally has a mechanism for reconstituting correct digital watermark data even when sub-data embedded in the data 
contents is corrupted to a certain extent, since the digital watermarking system assumes various processing on the 
watermarked data contents. However, under present circumstances, it is impossible for the system to evaluate validity 
of reconstituted digital watermark data quantitatively. Therefore, the system does not have enough reliability. 

20 SUMMARY OF THE INVENTION 

[0012] It is a first object of the present invention to tmprove quality of watermarked digital data contents and to improve 
durability of digital watermark data against media processing of the watermarked digital data contents. 
[0013] It is a second object of the present invention to evaluate quantitatively probabilities of cases that data contents 
25 which do not contain digital watermark data are wrongly judged as containing digital watermark data, and incorrect 
digital watermark data is read from watermarked digital data contents. 

[0014] It is a third object of the present invention to separate a digital watermark data sequence, when reading 
watermarked d igjtal data contents, from noise so that error bits which are included in the digital watermark data se- 
quence can be reduced, thereby watermark data reading success rate being improved in comparison with the conven- 
30 tional method without changing an amount of digital watermark data and a method of embedding the digital watermark 
data. 

[0015] The first object of the present invention is achieved by a method for embedding digital watermark data m 
digital data contents. The method includes the steps of: 

35 receiving the digital data contents and the digital watermark data; 

dividing the digital data contents into block data; 
obtaining a frequency coefficient of the block data; 
obtaining a complexity of the block data; 

obtaining an amount of transformation of the frequency coefficient from the complexity and the digital watermark 
40 data by using a quantization width; 

embedding the digital watermark data in said digital data contents by transforming the frequency coefficient by the 
amount; and 

generating watermarked digital data contents. 
45 [0016] The first object of the present invention is also achieved by a method including the steps of: 

receiving the digital data contents and the digital watermark data; 
dividing the digital data contents into block data; 
obtaining a frequency coefficient of the block data: 
50 obtaining an amount of transformation of the frequency coefficient from the digital watermark data by using a 

quantization width corresponding to the frequency coefficient, the quantization width being obtained beforehand 
according to a manipulation method of the digital data contents; 

embedding the digital watermark data in said digital data contents by transforming the frequency coefficient b y the 
amount; and 

55 generating watermarked digital data contents 

[0017] According to the above-mentioned inventions the amount of transformation of frequency coefficients is 
changed and/or the amount of transformation is increased or decreased according to the complexity of the digital data 
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contents. Therefore, the quality of the watermarked digital data contents can be improved and the durability of digital 
watermark data against a manipulation of the watermarked digital data contents can be improved. 
[0018] The second object of the present invention is achieved by a method for reading digital watermark data em- 
bedded in digital data contents, the method including the steps of: 

receiving the digital data contents; 

reading a bit sequence from the digital data contents; 

calculating a probability of reading a bit '1' or a bit '0* in the bit sequence by using a test method on the basis of 
binary distribution; 

determining the presence or absence of digital watermark data according to the probability; and 
reconstituting and generating the digital watermark data from the bit sequence. 

[0019] According to the above-mentioned invention, probabilities of the following cases can be evaluated quantita- 
tively. The cases are that digital data contents which do not contain digital watermark data are wrongly judged as 
containing digital watermark data, and incorrect digital watermark data is read from watermarked digital data contents. 
In addition, the probability can be suppressed within a constant value. 

[0020] The third object of the present invention is achieved by a method for reading digital watermark data from 
digital data contents in which each bit of digital' watermark data is embedded a plurality of times, the methoc : ncluding 
the steps of: 

receiving digital data contents; 

reading a digital watermark sequence from the digital data contents; 

performing soft decision in code theory by assigning weights to the digital watermark sequence with a weighting 
function; 

reconstituting the digital watermark data from the digital watermark sequence 

[0021] According to the above-mentioned invention, the digital watermark data sequence is separated from the noise 
so that error bits which are included in the digital watermark data sequence can be reduced, thereby the digital water- 
mark data reading success rate being improved in comparison with the conventional method. In addition, since weights 
are assigned to the digital watermark data sequence, the present invention is especially effective when the repeating 
number of watermark embedding is small. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0022] Other objects, features and advantages of the present invention will become more apparent from the following 
detailed description when read in conjunction with the accompanying drawings, in which: 



Fig. 1 is a block diagram of a digital watermarking system of the present invention; 

Fig. 2 is a general flowchart showing a digital watermark embedding process according to a conventional technique; 
Fig. 3 is a detailed flowchart showing a principal part of the digital watermark embedding process according to the 
conventional technique; 

Fig. 4 is a conceptual diagram of the digital watermark embedding process according to the conventional technique; 
Fig. 5 is a general flowchart showing a digital watermark reading process according to the conventional technique; 
Fig. 6 is a detailed flowchart showing a principal part of the digital watermark reading process according to the 
45 conventional technique; 

Fig. 7 is a block diagram showing receiving data and generating data of a digital watermark embedding apparatus 
of the present invention; 

Fig. 8 is a block diagram showing receiving data and generating data of a digital watermark reading apparatus of 
the present invention; 

Fig. 9 is a general flowchart showing a digital watermark embedding process according to a first embodiment of 
the present invention; 

Fig. 10 is a detailed flowchart showing a principal part of the digital watermark embedding process according to 
the first embodiment of the present invention: 

Figs. 11A and 113 are conceptual diagrams of the digital watermark embedding process according to the first 
embodiment of the present invention; 

Fig. 12 is a flowchart of a process for calculating a data complexity according to a second embodiment of the 
present invention: 

Fig. 13 is a flowchart showing a process for obtaining a watermark weight ratio data according to the present 
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invention; 

Figf 14 is a detailed flowchart showing a principal part of the digital watermark embedding process according to a 
third embodiment of the present invention; 

Fig. 15 is a detailed flowchart showing a principal part of the digital watermark reading process according to a fourth 
5 embodiment of the present invention; 

Fig. 16 is a flowchart showing a process of calculating a watermark strength matrix according to a fifth embodiment 

of the present invention; 

Fig. 17 is a block diagram of a computer; 

Fig. 18 is a block diagram of an integrated circuit; 
io Fig. 19 is a block diagram of a digital watermarking system of the present invention; 

Fig. 20 is a block diagram of a digital watermark reading apparatus shown in Fig. 19; 

Fig. 21 is a diagram for explaining judgment on digital watermark data; 

Fig. 22 is a conceptual diagram of reconstituting digital watermark data; 

Fig. 23 is a flowchart of a digital watermark reading process according to a seventh embodiment of the present 
?5 invention; 

Fig. 24 is a block diagram of a digital watermarking system according to an eighth embodiment of the present 
invention; 

Fig. 25 is a flowchart of a digital watermark reading process according to a tenth embodiment of the present inven- 
tion; 

20 Fig. 26 is a flowchart of a digital watermark reading process according to a tenth embodiment of the present inven- 

tion when reading digital watermark data sequence which is embedded after being modulated by a pseudo-random 
sequence; 

Fig. 27 is a diagram showing the result of reading a digital watermark data sequence without modulation; 
Fig. 28 is a diagram showing the result of reading a modulated digital watermark data sequence; 
25 Fig. 29 is a diagram showing a digital watermark reading process according to a conventional technique; 

Fig. 30 is a graph showing how MPEG-1 coding changes "1' data bit, specifically the graph shows occurrence 
frequency with respect to change amount of a DCT coefficient value by 1 .5-Mbps MPEG - 1 coding; 
Fig. 31 is a flowchart showing a principle of a thirteenth embodiment of the present invention corresponding to a 
third object; 

30 Fig. 32 is a block diagram of a digital watermark reading apparatus according to the thirteenth embodiment of the 

present invention; 

Fig. 33 is a general flowchart showing a digital watermark reading process according to the thirteenth embodiment 
of the present invention; 

Fig. 34 is a diagram showing the result of comparison of a digital watermark data reading success rate between a 
35 conventional reading method and the present invention. 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 



[0023] Before explaining embodiments of the present invention, definition of some words will be given. "Digital wa- 
termark data sequence" represents a data sequence read from the digital data contents before being reconstituted 
"Digital watermark data" represents significant data for system operation, which data needs to be embedded in the 
digital data contents, or, data obtained by reconstituting the digital watermark sequence. "Reliability (/. of digital water- 
mark" is an index representing validity of read digital watermark data. That is. it represents a probability that the read 
digital watermark data matches with the actual embedded digital watermark data. Conversely, a probability of reading 
45 digital watermark data from an image without digital watermark data or reading erroneous digital watermark data can 
be represented as 2(1-a). Similarly, "embedded sequence" represents data to be actually embedded. The embedded 
sequence includes sequence of embedded data which is modulated, extended or repeated. In addition, "read" may be 
replaced with "extract" in some cases. 

[0024] Fig. 1 shows a digital watermarking system of the present invention. In the system shown in Fig. 1, digital 
5v watermark data 101 is embedded in digital data contents 103 by a digital watermark embedding apparatus 102. then, 
converted into watermarked digital data contents 104. 

(0025] The watermarked digital data contents 104 are degraded to watermarked digital data contents 105 by com- 
pression or image processing while the watermarked digital data contents 1 04 are distributed by wireless or wire com- 
munication or by a packaged medium. 
55 [0026] A digital watermark reading apparatus 106 reads a watermark sequence from the degraded watermarked 
digital data contents 105. and reconstitutes digital watermark data 107. 

[0027] In the following, a digital watermark embedding method and a digital watermark reading method by using 
quantization will be described, since embodiments of the present invention are based on the methods. After the de- 
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scription of the methods, the embodiments of the present invention will be described. 

[0028] According to the digital watermarking technique based on quantization, digital watermark data is embedded 
by quantizing all or a part of data which is transformed (for example, by an orthogonal transform) from original digital 
data contents, or not-yet-transformed data. As for digital watermark data reading, data in the contents in which digital 
5 watermark data is embedded is quantized by the same value as a value used for embedding digital watermark data, 
and. then digital watermark data is determined from the quantized data value. 

[0029] In the following, a general outline of the methods will be described. The Japanese patent application No. 
9-57516. "Image processing method and the apparatus", and the Japanese patent application No. 9-164466, "Informa- 
tion embedding method, data reading method and the apparatus", and the like can be referred to for obtaining detailed 
10 information of the digital watermarking technique based on quantization. 

[0030] First, digital watermark embedding method based on quantization will be described. A process of the method 
is carried out by the digital watermark embedding apparatus 102 shown in Fig. 1. Fig. 2 is a flowchart showing the 
process. 

[0031] The digital watermark embedding apparatus 102 obtains block data 109 by dividing digital data contents 103 
is into a plurality of blocks (m blocks in this example) in step 100. Then, a frequency coefficient matrix 115(an orthogonal 

transform coefficient matrix) is generated by performing an orthogonal transform on the block data 108 in step 110. 

[0032] A pseudo-random sequence 1 25 is generated from input key data 12 in step 1 20. Then, a coefficient (for each 

block) from the coefficient matrix 115 is selected one by one using the pseudo-random sequence 125 so as to generate 

a frequency coefficient sequence 135 to be watermarked in step 130. Each bit of the digital watermark data 101 are 
20 diffused by repeating number (t) of embedding so that a digital watermark sequence 145 is generated in step 140. The 

digital watermark sequence 145 is embedded into the frequency coefficient sequence 135 such that a watermarked 

frequency coefficient sequence 155 is generated in step 150. 

[0033] After that, the frequency coefficient sequence 135 in the frequency coefficient matrix 115 is replaced by the 
watermarked frequency coefficient sequence 155 to generate a watermarked frequency coefficient matrix 165 in step 

25 1 60. Then, the watermarked frequency coefficient matrix 1 65 is inverse-orthogonal-transformed to form a watermarked 
block data 175 in step 170. After that, the block data part of the input digital data contents 103 is replaced by the 
watermarked block data 175 in step 180. As a result, a watermarked digital data contents is output. 
[0034] In the above description, data in which digital watermark data is embedded is assumed to be the coefficient 
of the frequency coefficient matrix. However, the data can be a pixel. In addition, when selecting a coefficient value 

30 from a block image, the number of the selected coefficient is not limited to one, that is, it can be more than one or zero. 
The present invention does not depend on the matter. 

[0035] In the process of diffusing the digital watermark data shown in Fig. 2. for example, a process represented by 
s[j][k]=w[j] is carried out for all j and k, then, the digital watermark data (w[0].w[1], .w[n-1]) is transformed to the digital 
watermark data sequence s[0][0].s[0][1]. ■ .s[0][t-1].s[1][0],s[1][1]. -.s(1][t-1].s[n-1][0].s[n-1][1], s[n-1][t-1]. 
35 [0036] In the watermark embedding process, quantization widths of frequency coefficients q[0].q[1];-.q[m-1] are 
used. 

[0037] In the following, the watermark embedding process (step 150) will be described in detail with reference to a 
flowchart shown in Fig. 3. 

[0038] Let the repeating number t of each bit of the digital watermark data be 



t = 



^5 



the frequency sequence be w[j],s[j][k]e{0. 1} {0-" j<n.0--" : k<t}. 

[0039] The watermark embedding process to the frequency sequence (f[i]) is carried out as follows. 
[0040] 1. Following steps are carried out for all i 
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[0041] 2. A watermarked frequency coefficient f[i] is obtained from the frequency coefficient f[i] according to following 
steps. 

[0042] i) If 
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is equal to s[X][Y]. 



[0043] ii) If 



15 



is different from s[X][Y] and 



30 is equal to 



[0044] iii) If 
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[0045] Here. X=i/t and Y=/modr. In addition. ->'i represents a maximum number which does not exceed x and x mod 
y represents the remainder of x divided by y. 

[0046] Fig. 4 shows the concept of the conventional watermark embedding process. As shown in the figure, digital 

'5 watermark data is embedded by changing a data value of watermarking area to a central value of the quantization width. 
[0047] Next, the digital watermark reading method based on quantization will be described. The process is carried 
out in the digital watermark reading apparatus 106. According to the digital watermark reading process, a digital wa- 
termark data sequence is read from watermarked contents, and. then digital watermark data is reconstituted by sta- 
tistically processing the digital watermark data sequence. 

20 [0048] Fig. 5 is a flowchart showing the conventional digital watermark reading process based on quantization. 

[0049] The digital watermark reading apparatus 106 obtains watermarked block data 205 by dividing watermarked 
digital data contents 105 into a plurality of blocks (m blocks in this example) in step 200. Then, a frequency coefficient 
matrix 215 is generated by performing an orthogonal transform on the watermarked block data 205 in step 210. 
[0050] A pseudo-random sequence 225 is generated from input key data 22 in step 220. Then, a coefficient value 

25 (for each block) of the frequency coefficient matrix 215 is selected one by one using the pseudo-random sequence 
225 so as to generate a watermarked frequency coefficient sequence 235 in step 230. Then, the watermark reading 
process is performed on the watermarked frequency sequence 235 so that a digital watermark sequence 245 is read 
in step 240. Finally, the original watermark data 107 is output by performing statistical processing on the digital water- 
mark data sequence in step 250. 

30 [0051] Next, the digital watermark reading process (step 240) will be described in detail with reference to a flowchart 
in Fig. 6. The process for reading the digital watermark sequence from the watermarked frequency coefficient sequence 
{f'[i]} is shown in the following. 
[0052] 1 . Following steps are carried out for all 



35 



i ( 0 s i < 



n ) 



40 



by using a frequency coefficient quantization width q. 

[0053] 2. The digital watermark sequence s[X][Y] is read from the frequency coefficient f [i]. That 



is. 



45 



q 2 



mod 2 



Here. 



60 



X = 



55 Y=/modr. 

[0054] When the digital watermark data is reconstituted from the digital watermark data sequence by performing 
statistical processing, a majority decision method such as 
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(0^j<n) 



5 



is used. 



W [0055] Next, the present invention corresponding to the first object will be described. 

[0056] Fig. 7 is a block diagram showing input data and output data of the digital watermark embedding apparatus 
of the present invention. The digital watermark embedding apparatus 102 inputs digital data contents 103 such as an 
image and a sound as main data, key data 12 and digital watermark data 101 as sub-data. The digital watermark 
embedding apparatus 102 embeds digital watermark data 101 into the digital data contents 103 and outputs water- 
's marked digital data contents 104. 

[0057] Fig. 8 is a block diagram showing input data and output data of the digital watermark reading apparatus of the 
present invention. The digital watermark reading apparatus 20 inputs the watermarked digital data contents 21 and 
the key data 22, and outputs digital watermark data 23 embedded in the watermarked digital data contents 21. Here, 
the key data 22 is the same as the key data 1 2. 
20 [0058] In the following, embodiments of the present invention will be described. 

(First Embodiment) 

[0059] A first embodiment of the present invention will be described. According to the first embodiment, digital data 
25 contents and digital watermark data is received and the digital data contents are divided into block data. Then, a 
frequency coefficient of the block data and the complexity is obtained. Next, an amount of transformation of the fre- 
quency coefficient is obtained from the complexity and the digital watermark data by using a quantization width, and 
the digital watermark data is embedded by transforming the frequency coefficient by the amount. Then, watermarked 
digital data contents is generated. 
30 [0060] The process of the first embodiment is a modified process of the digital watermark embedding process shown 
in Fig. 2, in which the modified part is the main part for watermark embedding. 

[0061] Fig. 9 is a flowchart of the whole process of the first embodiment. In Fig. 9. a step 190 which calculates the 
complexity and a watermark embedding process for varying the transformation amount according to the complexity 
(steps 195. 150) are different from the conventional process shown in Fig. 2. Therefore, the different point will be mainly 

35 described in the following. 

[0062] Block data 108 is input, and a complexity sequence 195 is generated by calculating a data complexity e[i] 
(0*?e[i]??1) for each block data in step 190. Then, the coefficient value of data to be watermarked is transformed to a 
value within quantization width according to the data complexity. In this embodiment, it is possible to use a conventional 
method for calculating the data complexity. For example, in the case of an image, a process for obtaining local image 

JO complexity can be used. In this case, it is necessary to normalize the range of the local complexity such that the range 
becomes from 0 to 1 , if the range is from -(/. to + |3. 

[0063] Next, the watermark embedding process which is the heart of the second embodiment will be described in 
detail. Fig. 10 is a flowchart showing the watermark embedding process (step 150 in Fig.9) in detail. 
[0064] The process for embedding digital watermark data into a frequency coefficient sequence {f[i]} of the first em- 
^5 bodiment is carried out as follows. 
[0065] 1. For all i 



50 



(0si< 



n 



55 



the following process is carried out. 

[0066] 2. A watermarked coefficient f [i] is obtained from a coefficient f[i]. 
[0067] i) If 
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mod 2 



is equal to s(X][Y]. 



xq 



[0068] ii) If 



is not equal to s[X][Y] and 



is equal to 



[0069] iii) If 



is not equal to s[X][Y] and 



is not equal to 



+ — 



mod 2 



1 2 



M 



f ' ( i] 



H 



c[i] 



<7 2 



mod 2 
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5 



f ' [ i] 




XCf . 



70 




Here, q represents the quantization width for digital watermark data embedding, X=i/j. Y=imodt, and i- v J is the maximum 
integer which does not exceed x. and x mod y represents the remainder of x divided by y. 

[0070] Figs. 11 A and 11 B are conceptual diagrams showing the digital watermark embedding process of the first 
'5 embodiment. As shown in Fig.11B. a data complexity etiKO^efiJ^I) is calculated for each block data. then, the value 
of data in which digital watermark data is embedded is transformed to a value within the quantization width according 
to the data complexity. 

[0071] Generally, the quality of the watermarked digital data contents is a trade-off for the strength of the digital 
watermark data. However, according to the present invention, both of the quality of the watermarked digital data con- 
20 tents and the watermark durability can be improved while keeping the quality and the durability in balance. That is, 
digital watermark data is embedded according to the local data complexity. 

Specifically, digital watermark data is embedded with a greater strength in a complex part., and is embedded with a 
weaker strength in a non-complex part. 

[0072] The watermarking technique has an embedding process and a reading process in a pair. However, even if 
25 the embedding process is modified from the conventional process according to the present invention, the reading 
process does not need to be changed from the conventional reading process for reading the digital watermark data 
which is embedded by the method of the present invention. (Second Embodiment) 

[0073] In the following, a second embodiment of the present invention will be described. The second embodiment 
relates to the process for calculating the data complexity (step 190 in Fig. 9). 
30 [0074] According to the second embodiment, the block data is transformed by applying Wavelet transform. Then, 
high frequency coefficient data of the wavelet transformed data is filtered by using a threshold value, and the complexity 
of the block data is calculated from the number of the data values which exceed the threshold. 

[0075] Fig. 12 is a flowchart of the process for calculating the data complexity according to the second embodiment. 
Here, let the dimension of a block data B[i] be N. and let the size of the block data be MqxM-, ■•■ xM N .-| . A following 
35 process is performed on each element h v0 v1 ... vN . 1 (0<v u <M L /2. 0<u<N) of the high frequency coefficient matrix 
'^ h n-i °f N dimensional wavelet transformed block data. 
[0076] 1. country^O 

[0077] 2. For all (v 0 .v 1l ---,v N . 1 . a step 3 is carried out (N dimensional loop). 

[0078] 3. For a threshold A?r0 which is set beforehand . If | hv 0 .v 1 .• ■■ ,v N _., | =rA. counts-count + 1 . Here. | x | represents 
40 the absolute value of x. 

[0079] 4. For a threshold which is set beforehand. 
If counter. e[i]*-1.0. If not. e[\]^-92^L. 

[0080] In the process for calculating the data complexity, for example, if it is assumed that N=2 (an image), the basis 
of the wavelet transform is the Haar basis. M 0 =16 and 1^ = 16. an experiment shows that values of A=4 and r=16 are 

^5 good for the balance for embedding digital watermark data without being notified by a human. 

[0081] According to the second embodiment, the above-mentioned function can be realized by setting the two tnresh- 
olds A and f according to the characteristics of the watermarking technique such as the kind of data to be watermarked, 
a unit (the size of the block data), the kind of orthogonal transform to be used. By applying the above-mentioned function 
to the watermarking technique, it becomes possible to embed digital watermark data more appropriately according to 

50 characteristics of individual digital data contents. 

(Third Embodiment) 

[0082] In the following, a fourth embodiment of the present invention will be described. 
55 [0083] According to the third embodiment, in the digital watermark embedding process, block data of digital data 
contents is obtained, and a transformation amount of frequency coefficient is calculated on the basis of a transformation 
amount for each frequency band according to a manipulation method of the digital data contents. Then, block data of 
the watermarked digital data contents is generated. 
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[0084] Let the dimension of a block data B[i] be N and the size be Mqa M |--aM n . v Here, a sequence which represents 
the ratio of transformation width for the frequency band of each frequency coefficient needs to be obtained beforehand 
by using adequate digital data contents before operating a digital watermarking system. The calculation method for 
obtaining q[i] will be described in detail in a fifth embodiment later. 

[0085] Fig. 13 is a flowchart for obtaining the ratio of the quantization with for each frequency band. First, digital data 
contents 1000 is input, and block data 1015 is obtained by dividing the input digital data contents into blocks in step 
1010. The block data 1015 is transformed to first frequency coefficient matrices by applying the orthogonal transform 
in step 1020. Next, digital data contents 1035 is generated by performing a manipulation such as non-reversible com- 
pression on the digital data contents 1000 in step 1030. Then, block data 1045 is generated by dividing the digital data 
contents 1035 into blocks in step 1040. Second frequency matrices 1055 is generated by applying the orthogonal 
transform to the block data 1015 in step 1055. Then, the variance of the distribution of the difference between each 
element of the frequency coefficients matrices 1025 and each element of the frequency coefficients matrices 1055 is 
obtained in step 1060. Finally, watermark weight ratio data d Vo« v i-""- v n-i ^070 which represents the ratio of transfor- 
mation for each frequency coefficient is obtained. The watermark weight ratio data obtained in this way is stored, and 
it is used in a watermark embedding process and a watermark reading process as necessary. The quantization width 
is obtained as d Vq.v^ •■,v N _ 1 ^Power which will be described next. 

[0086] Fig. 14 is a flowchart showing the watermark embedding process which is the heart of the third embodiment 
of the present invention. Here, the flow of the whole process is the same as that shown in Fig.2 or Fig. 9. 
[0087] Let the watermark weight ratio sequence be {dvQ.v-,.- .v,^} (0^v u <M u ,0^u<N), and let the watermark 
strength be Power (the watermark strength represents durability of digital watermark data against manipulations of 
watermarked digital data contents.) 

[0088] The watermark embedding process of the embodiment is carried out as follows. 
[0089] 1. For alii 



a following process is carried out. 

[0090] 2. A quantization width q[i] used when embedding digital watermark data into the frequency coefficient f[i] is 
obtained by q[i]^-d Vq.v-, . ' -.v^ ^ Power by using an element dv 0 ,v 1t • \v N _ 1 of the watermark weight ratio sequence 
which corresponds to the band of the frequency coefficient f[i] (f[i] is a (Vq^ -y N ^) th component of the frequency 
coefficient matrices). 



[0091] 3. The watermarked frequency coefficient f'[i] is obtained from the frequency coefficient f[i] in the following way. 
[0092] i) If 



( 0 £ i ■ < — - n ) , 



n 




is equal to s[X][Y], 




[0093] ii) If 




is not equal to s[X][Y] and 
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+ — 



is equal to 



w 



/to 



f ' (i| 



/[<] . I 



9 



+ 1 



20 [0094] iii) If 



25 



is not equal to s[X][Y] and 



/to . i 



mod 2 



30 



^1 2 



35 



40 



is not equal to 



45 



f ' [i] 



-1 



xq 



[']• 



[0095] Here, X = i/t. Y=i mod t. and is the maximum 

integer which does net exceed x, and x mod y represents the remainder of x divided by y. 
50 (Fourth Embodiment) 

[0096] In the following, a fourth embodiment of the present invention will be described. 

[0097] The fourth embodiment is a watermark reading process corresponding to the watermark embedding process 
of the third embodiment. According to the fourth embodiment, block data of watermarked digital data contents is ob- 
55 tained. and digital watermark data is read from frequency coefficients on the basis of transformation amount for each 
frequency band according to a manipulation method of the digital data contents. 

[0098] Fig. 15 is a flowchart showing the watermark reading process of the fourth embodiment of the present inven- 
tion. Here, as in the case of the third embodiment, let the watermark weight ratio sequence be {dv o .v 1 .--\v N _ 1 }(0"' v M <M n . 
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0'ru<N), and let the watermark strength be Power (the watermark strength represents durability of digital watermark 
data against manipulations such as non-reversible compression to watermarked digital data contents.) 
[0099] The process for reading the digital watermark data sequence from the watermarked frequency coefficient 
according to the fourth embodiment is carried out as follows. 
5 [0100] 1. For alii 



n ) 



a following process is carried out. 

[0101] 2. A quantization width q[i] used when reading digital watermark data from the frequency coefficient f[i] is 
obtained by q[i]^dv 0 .v 1 .- -,v N . 1 x Power by using an element dvQ.Vj.- -.v^-, of the watermark weight ratio sequence 
is which corresponds to the band of frequency coefficient f[i] (f[i] is a (v 0 , v 1 , ^n-t ) th component of the frequency coef- 
ficient matrices). 

[0102] 3. The digital watermark data sequence s[X][Y] is read from the watermarked frequency coefficient f'[i] in the 
following way. 



20 



My}- 



/[.] + i 



mod 2 



25 [0103] Here. 



30 



x- l - 

t 



and Y=/modf. 

[0104] According to the above-mentioned third and fourth embodiment, the watermark embedding strength can be 
changed according to the frequency band. Specifically, the watermark embedding and reading method applicable for 

35 a manipulation method becomes possible. In the method, according to the amount of change of digital data contents 
from original data for each frequency band due to manipulation such as non-reversible compression, the watermark 
strength is raised to a band when the amount is large, and the watermark strength is reduced when the amount is 
small. Accordingly, both of the quality of watermarked digital data contents and the durability of digital watermark data 
can be improved at a time. (Fifth Embodiment) 

-to [0105] In the following, a fifth embodiment of the present invention will be described. According to the fifth embodi- 
ment, a number of digital data contents (images, sounds and the like) are prepared and calculation of a watermark 
strength matrix is carried out for each frequency band. 

[0106] A processing flow of the fifth embodiment is the same as that shown in Fig. 1 3 basically. Here, the orthogonal 
transform process shown in Fig. 13 is the same as an orthogonal transform process used for digital watermarking 
45 process. For example, if the orthogonal transform used for digital watermarking is discrete cosine transformation (DCT) 
for a 16X16 size, the DCT is used, and, if the transformation used for digital watermarking is fast Fourier transform 
(FFT) for an 128X128 size, the FFT is used. 

[0107] Fig. 16 is a flowchart showing the process of calculating the watermark strength matrix for each frequency 
band according to the fifth embodiment. 
50 [0108] Here, let the frequency coefficient matrices be N, the size be Mq^M^- -aM^, and each of the components 
be x0 VQ V1 ... VN r xt VQ V1 ... VN 1 (0-^v u <M u . 0^ u<N). The process shown in Fig. 16 is as follows. 
[0109] 1. For all i (0^i<m). the following steps 2 and 3 are performed. 

[0110] 2. F^r all (vq.v, \> M ) =(0.0 Q)-(M 0 . M ] M M . } ). the process of the following step 3 is performed. 

[0111] 3. y *-xo VQV ^ ^ - xt VQVl ,,. M 

55 [0112] 4. FoT'a 1 !! (vffv\ \ N . t )=(0.0 0)-(M o .M, M N . t ). the following steps 5 and 6 are performed. 

[0113] 5. 
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Art 



[0114] 6. 




[0115] According to the fifth embodiment, it becomes possible to set the watermark strength to a level that is suitable 
for each frequency band according to a manipulation method for digital data contents such as non-reversible compres- 
sion. For example, if the watermark strength is Power and the distribution of the amount of change of each coefficient 
value of the frequency coefficients after a manipulation can be approximated by a Laplacian distribution, when digital 
watermark data is read from digital data contents on which an assumed manipulation is performed, the rate of bit 
reversal for the extracted digital watermark data can be made constant 



Power 




regardless of the frequency band (e is the natural logarithm). It is the advantage of the present invention to be able to 
predict the rate of bit reversal with the constant formula. In addition, according to the embodiment of the present 
invention, one of the problem of the conventional method that durability of embedded digital watermark data is varied 
according to the position of the frequency coefficient is solved. That is. the durability of the embedded digital watermark 
data is constant regardless of the position of the frequency coefficient (which is obvious from the above formula). The 
embodiment can be applied not only to the watermarking method based on quantization but also to a watermarking 
method based on the spread spectrum technique. 



(Sixth Embodiment) 



[0116] In the following, a sixth embodiment of the present invention will be described. According to the sixth embod- 
iment, the digital watermark embedding process is carried outby utilizing the first embodiment and the third embodiment 
in combination. The watermark embedding process of the sixth embodiment will be described as a modification of the 
step 150 shown in Fig. 9. 

[0117] According to the sixth embodiment, the process for embedding digital watermark data in a frequency coefficient 
sequence {f[i]} is as follows. 
[0118] 1. For all 



i ( 0 =s i" < 



n ) 



the following is performed. 

[0119] 2. A quantization width q[i) used when embedding digital watermark data into the frequency coefficient f[i] is 
obtained by q[i]<— dVQ.v^-'-.Vj^,^ a Power by using an element dv 0 ,v 1 .•■■.v N . 1 of the watermark weight ratio sequence 
which corresponds to the band of the frequency coefficient f[i]. 

[0120] 3. The watermarked coefficient f[i] is obtained from the frequency coefficient f(i] in the following way. 
[0121] i) If 



15 



EP 0 984 616 A2 



mod 2 



is equal to s(X][Y], 



f ' [i]— f [i] + 



f[i] + 1 

^ 2 



*q[i\- 



[0122] ii) If 



/['] , 1 



+ — 

1 



mod 2 



is not equal to s[X][Y] and 



/[;]. ' 

Wi 2 



is equal to 



[0123] Hi) If 



f ' [ i 



e[i] + r 



/[<] , 1 



mod 2 



is not equal to s[X](Y] and 



57] = 



16 



is not equal to 
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5 




10 



f ' [i] 



[W 2 J 2 J 



f[i] + l e[i) + l \ 



Here. X=i/t. Y=i mod t. and '.'J is the maximum 

15 integer which does not exceed x, and x mod y represents the remainder of x divided by y. 

[0124] According to the sixth embodiment, both of the quality of the watermarked digital data contents and the du- 
rability of digital watermark data can be further improved as compared with the first embodiment and the third embod- 
iment applied separately. The watermark reading method shown in the fourth embodiment can be used as-is for a 
watermark reading method in the sixth embodiment. 

20 [0125] The above-mentioned processes performed by the digital watermark reading apparatus and the digital wa- 
termark embedding apparatus according to the present invention can be constructed by a program which can be stored 
in a computer readable medium such as a disk unit, a floppy disk, CD-ROM and the like. Then, by installing the program 
into a computer from the medium, the present invention can be easily realized. Fig. 17 is a diagram showing the con- 
figuration example of the computer. As shown in the figure, the computer includes a CPU300. a memory 301, an 

25 external storage unit 302. a display 303, a keyboard 304 and a communication processing unit 305. The digital water- 
marking process of the present invention is carried out by running the program stored in the memory 301 on the CPU 
300. 

[0126] In addition, the digital watermark reading apparatus and the digital watermark embedding apparatus can be 
realized also by an integrated circuit shown in Fig. 18. The integrated circuit includes a memory part 401. a micro 

30 processor part 402. an interface part 403 managing an interface to an outside part. Since, the configuration in Fig. 18 
shows principal parts, the integrated circuit may includes other parts. The program stored in the memory part 401 is 
carried out by a micro processor part 402. The integrated circuit can take various other configurations. The integrated 
circuit can be incorporated to various apparatuses such as a camera so that the apparatuses can perform the digital 
watermarking process of the present invention. 

35 [0127] As mentioned above, according to the present invention, the rate of the amount of change of frequency co- 
efficients is changed, and/or. the amount of change of rate is increased or decreased according to the complexity of 
the digital data contents. Therefore, the quality of the watermarked digital data contents can be improved and the 
durability of digital watermark data against a manipulation of the watermarked digital data contents can be improved. 
[0128] Next, embodiments of the present invention corresponding to the second objectives will be described. 



(Seventh Embodiment) 

[0129] In the following, the seventh embodiment of the present invention will be described with reference to figures. 
[0130] Fig. 19 is a block diagram of a digital watermarking system to which the present invention relates. Fig. 19 

45 shows a similar configuration to that shown in Fig. 1. The difference is that Fig. 19 shows a digital watermark data 
reconstitution apparatus 108 which is an essential part of the present invention. The digital watermark data reconsti- 
tution apparatus 108 is provided in the watermark embedding apparatus 106. In the system, a digital watermark data 
sequence is read from the watermarked digital data contents 105 by using the watermark reading apparatus 106. Then, 
the digital watermark data sequence is processed in the digital watermark data reconstitution apparatus 108 so that 

50 the read digital watermark data 107 is obtained. 

[0131] In the following, the process for reconstituting the digital watermark data is described in detail. 
[0132] Fig. 20 is a block diagram of the watermark reading apparatus 106. The digital watermark data reconstitution 
apparatus 108 provided in the watermark reading apparatus 106 obtains the probability q that bit 1 is read when any 
1 bit watermark sequence is read from a whole watermark area beforehand by using the watermark reading apparatus 



[0133] Specifically, assuming a 1 bit watermark sequence reading part 501. the part 501 reads the watermark se- 
quence 1 bit by 1 bit from all elements of the whole watermark area (a broken line L1 ). and calculates the ratio of the 
number of bit 1 to the number of all trials. 



40 



55 



106. 
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[0134] In the embodiment, the reading probability of bit 1 and the number of bit 1 are obtained. However, it is possible 
that the reading probability of bit 0 and the number of bit 0 are obtained. Basically, there is no difference between the 
former and the latter. The difference is only on implementation. 

[0135] Accordingly, the probabilities of detecting bit 0 and 1 when reading 1 bit at random in the watermark area by 
using the digital watermarking algorithm is calculated to be 1-q and q respectively. 

[0136] The n bit watermark sequence reading part 502 reads the digital watermark data sequence from the water- 
marked digital data contents for the number of total times of embedding digital watermark data. 

[0137] Here, digital watermark data is defined as b 0 , b 1 b m . 1t bj <E{0, 1}. i<m (m bit length), the repeating number 

of embedding ith bit of the digital watermark data in the digital data contents is defined as r\, the read watermark 
sequence is defined as 

bo 0 . b 01 , ...b' 0in<M , b' 10 , bVi. ••• b'., nV1 b' m .^ 0 . b' m . 1ilt ... b' nv1 nnv1 b Lj e{0. 1} 



n r bit length). 

[0138] The data reconstitution apparatus 108 receives a subsequence of the digital watermark data sequence one 
after another from a subsequence corresponding to 0th digital watermark data to a subsequence corresponding to (m- 
1)th digital watermark data (a solid line L2). 

[0139] Next, the method for reconstituting ith bit of the digital watermark data will be described concretely. 

[0140] When nj bits of digital watermark data sequence is read at random from the watermark area, the probability 

P(x=k) of k T bits appearing in the n } bit sequence is represented by the binary distribution density function 

P(x = k)=n j C k q k (1-q) ni - k (1) 

and the distribution function of that. F(x). is 

F(x) = n i C ^'(^q) ni 'M0^xSnJ. (2) 

Here. njC k is the number of combinations when selecting k out of n^ 

[0141] Setting a reliability threshold value (/. (1/2<c/.^1) of the digital watermark data, the number of bit 1 included in 
a subsequence b'jo- b' L1 . ■ ■■b" \ nM corresponding to ith digital watermark data is calculated by 

*i - ST- b '" ■ 

Then, digital watermark data is determined in the following way by using the formula (2): 

r 0 when O^FfkJ^l - a 

b t = I 1 when a^F(kJ^l (3) 

L unknown or when l-a< F(k t )<a 

not present 

[0142] Viewing from a different angle, when determining by the number of bit 1 included in the watermark sequencen 
nj, if the largest integer x 0 that satisfies 0r-F(x = x 0 ) 1-u and the smallest integer x1 that satisfies <// F(x = x^ •■■"1 
are assumed to be threshold values, the digital watermark data is judged as shown in Fig. 21 such that if the number 
of 1 in nj is equal to or smaller than x 0 . the digital watermark data is 0. and that if the number of 1 is equal to or larger 
than x v the digital watermark data is 1. 
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[0143] The horizontal axis of Fig. 21 represents the number of bit 1 included in the watermark sequence, and the 
vertical axis represents frequency of the corresponding number. As for unwatermarked digital data contents, the fre- 
quency that bit 1 appears in a bit sequence read at random from the digital data contents becomes binary distribution. 
Thus, the peak of the frequency is at the half point of the number of bits. On the other hand, as for watermarked 
contents, in the subsequence nj in which bit 0 is embedded as digital watermark data, the frequency of bit 1 is 0 if there 
is no degradation and it is a small number which is equal to or smaller than Xq even if there is degradation. In the 
subsequence nj in which bit 1 is embedded as digital watermark data, the frequency of bit 1 is n1 if there is no degra- 
dation and it is a large number which is equal to or larger than x 1 evenif there is degradation. In this way. the distribution 
of the frequency of bit 1 or bit 0 in the watermarked sequence is leaning to one side from the center of the binary 
distribution. The present invention uses the lean for reconstituting digital watermark data from the read watermark 
sequence. 

[0144] Depending on a watermarking system, a following method can be used. That is. reconstituted digital water- 
mark data is obtained by using the bias from the central value of the distribution P(x) of the watermark sequence 
extracted from digital data contents 105. Next, the probability of appearing the read watermark sequence is calculated 
by the formula (2). Then, if the reconstituted digital watermark data is 1 . F(kj) can be added to watermark dada as the 
reliability, and, if the reconstituted digital watermark data is 0, 1-F(kj) can be added. The reliability F(kj) and 1 -F(kj) of 
the digital watermark data is obtained from the bias of appearance probability of the digital watermark data in the binary 
distribution of appearance probability of each bit of 1 bit sequence extracted at random from digital data contents. 
[0145] Fig. 22 shows a concept in which the length of the digital watermark data is extended to m bits. 
[0146] The digital watermark data reconstitution apparatus 108 outputs the reconstituted digital watermark data b 0 . 
b-i, b m .-, as read digital watermark data 107. 

[0147] Fig. 23 is a flowchart showing the above-mentioned process. The process will be described in the following 
with reference to Fig. 23. 

[0148] Watermarked digital data contents 105 and key data which is necessary for reading digital watermark data is 
input, and a digital watermark data sequence is extracted with respect to each bit value in step 1. Then, a threshold 
value (/. of the reliability is set in step 2. and a probability q that bit 1 appears when 1 bit of digital watermark data is 
read at random from the whole watermark area is obtained in step 3. Then, a binary distribution function F(x) which 
represents probability that x bits of 1 are included in the bit sequence is obtained from the probability q and the repeating 
number n t of each bit of digital watermark data in step 4. 

[0149] Then, 0 is assigned to i which distinguish a subsequence of the digital watermark data sequence in step 5. 
Next, the number of bit 1 in the subsequence is obtained as 



and the appearance probability F(kj) is obtained, then it is determined whether F(k i ) is equal to or less than 1-u in step 
6. If F(kj)^1-u. the digital watermark data w' t is reconstituted as 0 in step 7. Then, i is incremented by 1 in step 8. and 
the process goes back to step 6 if i<m in step 9. If F(kj):M-(/ is not true in step 6. it is checked whether F(kjp-</. is true 
in step 10. If F(kj)5r/., the digital watermark data Wj is reconstituted as 1 in step 11 . and the process goes to step 8. If 
F(kj)^<v. is not true in step 10, the process ends by determining as there is no watermark or the presence or absence 
is unknown in step 12. If i is more than n t in step 9, a reconstituted watermark sequence {w* } is output. In the above 
process, the reading process in step 1 can be carried out between step 4 and step 5. In step 6. it is checked whether 
1-F(kj) is more than a. 

[0150] In the seventh embodiment, it is assumed that there is no bias in the distribution represented by formula (1). 
that is. q=1/2. 

[0151] When the embedding number n s of each bit of digital watermark data is adequate for obtaining a statistical 
characteristic, it becomes q - 1/2 generally. However, since the value of q depends on characteristics of an watermark- 
ing algorithm and digital data contents, q may take a value deviating largely from 1/2 in some rare cases. A method 
for solving this problem will be described in a eighth embodiment. 

(Eighth Embodiment) 

[01 52] In the following, the eighth embodiment will be described. Fig. 24 is a block diagram of a watermarking system 
of the ninth embodiment. 

[0153] The watermark embedding apparatus 102 embeds digital watermark data 101 in digital data contents 103. 
At the time, when embedding each bit value r\ t times repeatedly, watermark sequence is modulated and embedded in 
the digital data contents 103. The modulation is carn-'i '.ut by a pseudo-random sequence generator (A) 601 which 




b ' 



i . r 
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is provided in the watermark embedding apparatus 102. 

[0154] For example, when assuming the embedding sequence as b 0 0 , b 0 A ...b 0 n0 .v b i.o- b i-v •• b i.m-i b m-i o- 

b m-i i- ■•■ b m-i nm-1 -1 b i j 1 )- anci tne pseudo-random sequence as r j0 . r s -,. ...r jnM b s j e{0, 1}. the embedding se- 
quence is modulated to 

5 

m i0 . m i1( ...m inM 
m u = b,j (+) r u 

by the pseudo-random sequence. A(+)B represents XOR of A and B. 
w [0155] According to the above-mentioned process, the same pseudo-random sequence is necessary for digital wa- 
termark data reading. 

[0156] For example, if 1 bit watermark sequence is read by using an M-sequence as the pseudo-random sequence, 
it becomes q=1/2. Therefore, the present invention can be applicable without depending on the watermarking algorithm 
and digital data contents. 

'5 [0157] When digital watermark data reading, demodulation is carried out as b'jj = rrij j (+) r ri by using a pseudo- 
random sequence generator (B) 602 which is provided in the watermark reading apparatus 106. 

[0158] Here, the pseudo-random sequence generator (A) 601 and the pseudo-random sequence generator (B) 602 

needs to be implemented such that both of the generators generate the same pseudo-random sequence. 

[0159] Watermark data is reconstituted with the method of the seventh embodiment from the watermark sequence 

20 b o.o> b 'o.i- - b 'o.no-i. b 'i.o- b Yi< - b Ym-i b m-i.o- b m-i.T ■ b m-i.nm-i -1 b i.j ^{0, 1} obtained by the demodulation. 

[0160] Since it-is considered that the appearance probability q of bit 1 in the watermark sequence can be approxi- 
mated by the binary distribution regardless of the presence or absence of modulation, there is no influence on the 
distribution of the density function (1) due to the modulation shown in this embodiment. 

[0161] In addition, q=1/2 can be assumed in implementation, that is, no process is necessary for obtaining q. There- 
25 fore, the amount of processing that is required for watermark reconstitution thus becomes the same as that for majority 
decision processing. Thus, the reconstitution process becomes faster. 

(Ninth Embodiment) 

30 [0162] In the following, a ninth embodiment will be described. In the ninth embodiment, an example will be described 
showing concrete values on the basis of the seventh embodiment and the eighth embodiment. In this embodiment, it 
is assumed that digital watermark data is 1 bit. the repeating number n of embedding is 127 and the probability q that 
bit 1 is read when reading 1 bit watermark sequence at random from the whole watermark area is 1/2. If the threshold 
value it is 0.99999 (which means 99.999%), x 0 in Fig. 21 is 36 and x-, is 90. That is to say. according to the present 

35 invention, under the above-mentioned condition, digital watermark data is judged as bit 0 if the number of 'V appeared 
in the watermark sequence (n bits) is equal to or less than 36, and it is judged as bit 1 if the number of '1 ' appeared in 
the watermark sequence (n bits) is equal to or more than 90. and it is judged that there is no watermark data or the 
presence or absence is unknown in other cases. If it is judged that there is digital watermark data, the correctness of 
more than 99.999% can be ensured. 

40 

(Tenth Embodiment) 

[0163] A tenth embodiment will be described in the following. According to the embodiment shown in Fig. 23. as is 

understood from the above-mentioned procedure, if even the reliability of only 1 bit is not obtained, that is. if F(k,) or 
45 1-F(kj) is less than a. the reconstitution of the digital watermark data becomes impossible because it is judged that 

there is no digital watermark data or the presence or absence is unknown. The tenth embodiment solves the problem. 

In this case, it is assumed that the repeating number of embedding each bit of digital watermark data is the same value n. 

[0164] The method for reconstituting digital watermark data w 0 . w 1 w m _., from watermark sequence b' D 0 . b' 0 v ... 

b 'o no-i ■ b i o- b 'i 1 • ■■■ b 'i n-1 b m o- b 'm 1- - b m n-1 which is read from digital data contents will be described in the 

50 following with reference to Fig. 25. 

[0165] The watermark sequence is read with respect to each bit value from the digital data contents and key data 

necessary for digital watermark data reading in step 1. 

[0166] The threshold value a (1/2<(/.-- r 1 ) of the reliability is set in step 2. For example, if the reliability of read digital 
watermark data needs to be equal to or more than 99"I.. it is set as </.=0.99. 
55 [0167] The probability q of bit 'V when 1 bit of the watermark sequence is read at random from the whole watermark 
area of watermarked digital data contents is obtained beforehand in step 3. The appearance probabilities of bits '0' 
and T are calculated as 1-q and q respectively. 

[0168] The probability that x bits of T are include-'.: rhe watermark sequence of each bit data of digital watermark 



20 



EP 0 984 616 A2 



10 



data are obtained as 



by using the binary distribution function in step 4. 

[0169] It is checked in step 5 that the probability that n bit digital watermark data sequence is digital watermark data 
exceeds the threshold value ex. Specifically, it is checked whether the following formula (4) is satisfied. 
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Here, |a| represents the absolute value of a. 



2"-.' b ^- n / 2 



25 represents the bias from the center of the binary distribution of the number of bit 'V in the n bit watermark sequence. 



m-l n - ! 

...» ° f b >.i" n / 2 



divided by m represents the average for the m bits of the whole digital watermark data, n/2 represents the center of 
the binary distribution. 

[0170] If the formula (4) is true, itis judged that there is digital watermark data. Thus, in each n bit watermark sequence 
of m digital watermark data sequences, digital watermark data is reconstituted by a majority decision processing. 
35 [0171] Specifically, if it is judged that there is digital watermark data, digital watermark data is reconstituted in the 
following way in step 6. 
[0172] For all i (0^i<m). when 
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when 
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w '. = 1 
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This process is carried out by steps 6-1 - 6-7 in Fig. 25. If 
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it is judged that there is no watermark data or the presence or absence is unknown. A following formula (5) can be 
used instead of the formula (4). 
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If the formula (5) is not true, it is judged that there is no watermark data or the presence or absence unknown. 
[0173] According to the tenth embodiment, statistical processing for whole watermark sequence is carried out so as 
to judge the presence or absence of watermark by using the formula (4) or the formula (5). If it is judged that there is 
15 digital watermark data, the reconstitution is carried out by the majority decision processing. Therefore, even if there is 
one bit of low reliability, digital watermark data can be reconstituted. 
[0174] In Fig. 25, the step 1 can be carried out between the steps 4 and 5. 

[0175] The tenth embodiment may use the pseudo-random sequence which is described in the eighth embodiment. 
Specifically, watermark embedding is carried out by modulating digital watermark data sequence with the pseudo- 

20 random sequence. When reconstituting, the read digital watermark data sequence is demodulated by the pseudo- 
random sequence, then the judgment by the formula (4) is performed . If the result is more than t/. and there is digital 
watermark data, the reconstitution process of the majority decision is performed on the demodulated sequence, which 
is the same process as the step 6 of the eleventh embodiment. The whole process is shown in Fig. 26, adding the same 
reference symbol to the corresponding part shown in Fig. 25. In the example, the pseudo-random sequence {r^} is 

25 generated from key data 'Key' and the process goes to step 2 in step 8. Next to step 4, watermark sequence is de- 
modulated with the pseudo-random sequence {r j;j } in step 9. The watermark bit b'j j in the formula (4) in step 5 is a bit 
which is demodulated in step 9. Also, the majority decision processing in step 6 is performed on bg. 

(Eleventh Embodiment) 

30 

[0176] In the following, a eleventh embodiment will be described. 

[0177] Since digital watermark data is dispersed by the pseudo-random sequence, when q is approximated to 1/2. 
the presence or the absence of watermark data in the watermark sequence can be judged as follows. 
[0178] The probability that x bits of T (a number x of T bits) are included in the n bit watermark sequence which 
35 constitutes each bit of digital watermark data is represented as 



40 



by using the binary distribution function. 

Accordingly, by obtaining the smallest integer x 1 which satisfies F(x=x 1 ) : ? > a, the demodulated sequence of the step 5 
in the tenth embodiment can be judged with the following formula (6). 
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In this case, the amount of processing can be reduced to the same level as that of the majority decision processing. 
[0179] The judgment is equivalent to a judgment for judging whether the average of the bias from the center n/2 of 
the binary distribution of the watermark sequence is equal to or more than x v 

[0180] If the formula (6) is true and it is judged that there is digital watermark data, the majority decision process is 
performed on the watermark sequence which is demodulated by the pseudo-random sequence in the following way. 
-[0181] For all i (0^i<m). when 



22 



EP 0 984 616 A2 



w 



2"-'. b '^< n / 2 = W l = 0. 



when 



w ' . = I 



?5 



20 



Then, the digital watermark data is reconstituted. 
[0182] If 
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it is judged that there is no watermark data or the presence or absence is unknown. 

[0183] In the above process, it is possible to use the maximum integer Xq which satisfies F(x=x 0 ) ; '-* instead of 
the minimum integer x 1 which satisfies F(x=x 1 )=:;(/.. In this case, a formula for judging the presence or absence of 
watermark is shown below as a formula (7). 
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(7) 



35 



If the left part of the formula is more than Xq, it is judged that there is no watermark data or the presence or absence 
is unknown. 



(Twelfth Embodiment) 

[0184] In the following, a twelfth embodiment of the present invention will be described. 
•to [0185] When it is judged that there is digital watermark data by the formula (4), the digital watermark data is recon- 
stituted by the above-mentioned majority decision process. At the same time, the reliability of the reconstituted water- 
mark sequence as a whole is calculated as 
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and it is output. 

[0186] Similarly, when it is judged that there is digital watermark data by the formula (5) and the digital watermark 
data is reconstituted, the reliability of the reconstituted digital waterma data sequence as a whole is calculated as 
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10 and it is output. 

[0187] When it is judged that there is digital watermark data by the formula (6), the digital watermark data is recon- 
stituted by the above-mentioned majority decision process. At the same time, the reliability of the reconstituted water- 
mark sequence as a whole is calculated as 



15 



20 



( m-\ n-1 



m 



n 

+ — 

2 



25 



and it is output. 

[0188] Similarly, when it is judged that there is digital watermark data by the formula (7), the reliability of the digital 
watermark data as a whole is calculated as 
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and it is output. 

[0189] In the above-mentioned seventh - twelfth embodiments, the reading probability of bit 1 and the number of bit 
1 are obtained. However, it is possible that the reading probability of bit 0 and the number of bit 0 are obtained. Basically, 
there is no difference between the former and the latter. The difference is only on implementation. 
[0190] In the following, examples of experiments will be shown. In the following experiments, an image of "lena" 
which has 1 28X128 pixels is used as a test image, and the threshold valuer/, of the reliability is assumed to be 0.999999. 

(First Experiment) 

[0191] In this experiment, 1 bit digital watermark data '1 ' was embedded 127 times repeatedly using key data '50,000'. 
and the watermark sequence was read with various key data. Fig. 27 shows the number of bit T in the read watermark 
sequence corresponding to the key data. In Fig. 27. the vertical axis shows the number of bit '1' in the read watermark 
sequence, and the horizontal axis shows the key data value. In this experiment, the appearance frequency of bit '1' in 
the watermark area A was q=0. 492247. 

[0192] When correct key data (50.000) is used, it is judged that digital watermark data is '1' with 99.9999% correctness 
since the number of bit '1' is more than the threshold value x 1 forjudging the presence of watermark. When incorrect 
key data is used, it is judged that there is no watermark data or the presence or absence is unknown. 

(Second Experiment) 

[0193] In the second experiment, a watermark sequence which was modulated with a 7 stage M-sequence (initial 
state is 64) was embedded, and a similar experiment as the first experiment was carried out with various key data and 
M-sequences of various initial states. The result is shown in Fig. 28. By carrying out the modulation, the value of q 
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becomes 0.500000 from 0.492247. and the variance becomes 31 .71 8777 from 31 .008265. Thus, the values are almost 
not changed from those of the first experiment. It is only when correct key data and correct pseudo-random sequence 
are used that digital watermark data can be read. In addition, when the watermark sequence is embedded in half data 
of the watermark area A, q=0. 741547 with the modulation and q=0. 499768 without the modulation. 
[0194] The effects of the present invention corresponding to the second object is as follows. 

(1) There are following effects by judging digital watermark data on the basis of the binary distribution in statistics: 
- The probabilities of following cases can be evaluated quantitatively. The cases are: digital data contents which 
do not contain digital watermark data are wrongly judged as containing digital watermark data, and incorrect digital 
watermark data is read from watermarked digital data contents. In addition, the probability can be suppressed 
within 2(1 -a) by using the reliability threshold u of digital watermark data. 

(2) There are following effects by modulating digital watermark data by a pseudo-random sequence before em- 
bedding the digital watermark data: 

The bias of the probability q of reading bit 'V when 1 bit watermark sequence is read at random from the whole 
watermark area. 

!t becomes difficult to detect the presence or absence of watermark data and the value from the bias of q 
without the correct key data and the pseudo-random sequence, the key data being necessary for reading 
digital watermark data and the pseudo-random sequence being necessary for demodulating read watermark 
sequence. It can strengthen security which is an important element for the digital watermarking system. 
In an implementation, since it can be assumed to be q=1/2, the amount of processing that is required for 
watermark reconstitution becomes the same as that for majority decision processing. Thus, the speed of the 
processing becomes higher. 

u is an index which represents a lower limit of the correctness rate of read digital watermark data, and is 
manageable in the digital watermarking system. Therefor, the method of using a is superior to a conventional 
method of showing the correctness rate of read digital watermark data to a user. 

[0195] According to the seventh embodiment, if there is even one bit of low reliability in digital watermark data {w'j}. 
it is judged that there is no watermark data or the presence or absence is unknown. However, even in the case, ac- 
cording to the eleventh - thirteenth embodiments, the reliability of digital watermark data can be evaluated quantitatively, 
the probability for reading digital watermark data incorrectly can be suppressed within 2(1 -a), and the digital watermark 
data can be reconstituted. In the tenth - twelfth embodiments, the whole digital watermark data is statistically processed 
by modifying the formula forjudging the presence or absence of digital watermark data, since digital watermark data 
can be reconstituted in many cases, when watermark sequence {b' j j}. (0^i<m,0^j<n) is seen statistically as a whole. 
[0196] In addition, according to the seventh embodiment. 



needs to be calculated with the distribution function F(x) of the binary distribution for all i to reconstitute digital watermark 
data from watermark sequence. On the other hand, according to the tenth - twelfth embodiments, only one calculation 
using the distribution function is necessary so that the amount of processing can be reduced. 

[0197] The present invention becomes more effective in combination with an error correction code. That is. when a 
part of bits in digital watermark data is intensively corrupted, it is judged that only the part of bits is unknown and other 
bit data is in high correctness rate. 

Therefore, correct data can be read by correcting only the corrupted bit data. 

[0198] In the present invention corresponding to the second object, the above-mentioned processes can be consti- 
tuted by programs which can be stored in a computer readable medium. Therefore, digital watermarking processing 
of the present invention can be carried out with a computer such as one shown in Fig. 1 7. Additionally, the watermarking 
apparatus of the present invention can be realized by an integrated circuit such as one shown in Fig. 18. 
[0199] In the following, the present invention corresponding to the third object will be described. 
[0200] In the first place, the conventional digital watermark reading method will be further described in order to clarify 
the feature of the present invention corresponding to the first objective. The conventional method is based on hard 
decisions on binary coding in code theory, which is shown in Fig. 29. With respect to the watermark reading method 
based on hard decisions on binary coding, if almost all watermarked contents are embedded within a same range 
(shown as the diagonally shaded area), the performance is enough. 
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[0201] However, according to the conventional watermark reading method, there is a following problem. Fig. 30 shows 
a graph showing how MPEG-1 coding changes 'V data bit. specifically the graph shows occurrence frequency with 
respect to change amount of a DCT coefficient value by 1 .5Mbps MPEG -1 coding. As shown in Fig. 30. there is a case 
in which a considerable amount of watermarked data appears in the vicinity of the boundary of the reading bit value 
(which is shown in two dotted circles a and a'). As a result, it becomes difficult to separate noise from the watermarked 
data. In addition, there is a possibility that a digital watermark data value which is read becomes reversed with respect 
to the embedded digital watermark data. 

[0202] In order to avoid the problem, two measures are conceivable. First measure is to raise the data diffusion rate 
by increasing the number of times the data is embedded. Second measure is to increase the watermark embedding 
strength. Neither of these measures is a true solution because the first one may reduce the relative amount of embedded 
data and the second one may degrade the image. Accordingly, the present invention adopts soft decision rather than 
hard decision. In the following, a general description of the present invention will be given. 

[0203] Fig. 31 is a diagram of a principle of the present invention corresponding to the third object. As shown in the 
figure, a watermark sequence and the reliability is obtained in step 1. and, then, most likely digital watermark data is 
reconstituted based on the watermark sequence and the reliability in step 2. 

[0204] Fig. 32 is a diagram showing a principle of a digital watermark reading apparatus of the present invention. As 
shown in Fig. 32. the digital watermark reading apparatus includes a digital watermark sequence obtaining part 1 and 
reconstitution part 2. The digital watermark sequence obtaining part 1 obtains the most likely digital watermark se- 
quence and the reliability by carrying out soft decisions of coding theory using a weight function, and the reconstitution 
part 2 reconstitutes digital watermark data on the basis of the most likely digital watermark sequence and the reliability. 
[0205] Inferring from the frequency plot shown in Fig. 30. it is easy to detect the digital watermark data sequence 
correctly if the repeating number of embedding is large enough. However, if a sufficiently large repeating number can 
not be obtained in actual practice, it becomes difficult to detect the desired digital watermark data sequence, thus 
filtering for watermarked content data becomes important. For example, concerning data which exists in the dotted 
circle in Fig. 30, it is difficult to determine whether the data is watermarked content data or noise. Thus, it is needed to 
separate watermarked content data from noise effectively. For that purpose, according to the present invention, weights 
are assigned to the digital watermark data sequence by using soft decisions of coding theory. Specifically, distribution 
of watermarked content data is predicted, then digital watermark data is reconstituted from a digital watermark data 
sequence to which a corresponding distribution value is added as the weight. 

[0206] Accordingly, the watermarked content data can be separated from noise. Thus, error bits included in the digital 
watermark data sequence can be reduced, thereby a success rate of reading digital watermark data improves as 
compared with the above-mentioned conventional methods. According to the present invention, it becomes possible 
to see significant distribution bias in the watermark content data when the repeating number of embedding digital 
watermark data is small. 

[0207] In the following, the present invention corresponding to the third object will be described in detail. 
[0208] First, the operation of the digital watermark reading apparatus 106 will be described. Fig. 33 is a diagram for 
explaining the operation. As shown in Fig. 33, the process according to the present invention corresponds to the process 
shown in Fig. 5 in which steps 240 - 250 are improved. 

[0209] As shown in Fig. 33, in the digital watermark reading apparatus 106, when reading digital watermark. 
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The function weight will be described later. 

[0210] In the process for reconstituting digital watermark data by performing statistical processing on a digital wa- 
10 termark data sequence, for example. 



20 is used for the reconstitution. 
(Thirteenth Embodiment) 

[0211] In the following, a thirteenth embodiment of the present invention will be described. In the following example. 
25 the digital watermark reading process based on quantization in the digital watermark reading apparatus 106 will be 
described. 

[0212] According to the thirteenth embodiment of the present invention, the digital watermark embedding process 
is not changed from the conventional method. On the other hand, the digital watermark reading process is modified in 
order to improve digital watermark reading performance. 

30 [0213] Here, let digital watermark data to be embedded in contents be w 0 ,w 1 ,-\w n _ 1 .Wj e{-1,1}, 0 ^i^n-1, and let 
a data set in which digital watermark data is embedded be {d 00 .d 01 .■•.d 0nv 1 .d 1<0 ,d 1a1 , ••.d 1 <m-1 d^ 
d n-i,m-i }• Let a quantization value used for quantize data djj (0^i^ n-1,0^j^m-1) be q^. Each bit data Wj of digital 
watermark data is embedded m times repeatedly. The digital watermark embedding process based on quantization is 
assumed to be a process in the following. 

35 [0214] For all i and j (0^i^n-1, 0<j^m-1) i) If 
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[0215] ii) If 
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25 [0216] iii) If 
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Here. i is a maximum number which does not exceed x. "x mod y" represents the remainder of x divided by y. 
[0217] The present invention is not only applicable to the contents in which digital watermark data is embedded in 
the above-mentioned way but also applicable to other contents in which digital watermark data is embedded in other 
equivalent way. 

[0218] In the following, the operation of the digital watermark reading apparatus 106 will be described in detail. 
[0219] According to a following process, a watermark sequence 



w 



is read 

from a set of data values 



75 



of the 

20 watermarked digital data contents 105 in which digital watermark data is embedded. 
[0220] For all i and j (0?:i^n-1. 0^jm-1) 
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[0221] Here, weight(x) (the domain is -j<x<^, and the range is equal to or more than 0. The function weight(x) will 
be called a weight function hereinafter) is a function which assigns weights to a read watermark sequence. By adopting 
a function which takes a large value in the vicinity of the central value (in the vicinity of the dotted vertical axis' in Fig. 
30) and takes a small value in the vicinity of the boundary of the bit value (in the dotted circle in Fig. 30). it becomes 
possible to separate effective watermark data sequence from noise. 

[0222] Of course, it is possible to adopt a stretched weight(x) function in which the domain and the region is not 
limited. However, in the case, it is necessary to change the above mentioned formula to some extent. 
[0223] Contents in which digital watermark data is embedded by digital watermark embedding processing is degraded 
due to data compression, media processing and the like. Thus, a watermark embedded data value-*" deviates in some 
degree from a value djj of immediately after being embedded. Therefore, it is desirable to adopt a following function 
as the weight function. The function can be obtained such that the distribution of the ratio 



50 



of the amount of the deviation between 6 X j and < to the quantization value qj j is predicted, and it is normalized with 
55 an appropriate scale for approximation. (There is no condition for the scale.) 

[0224] For example, when assuming that digital watermark data is read from watermarked motion pictures which 
are MPEG compressed, the distribution shown in Fig 30 can be approximated by a Laplacian distribution. Thus, a 
Laplacian distribution of average 0 and variance 0 08 m <\ normal distribution of average 0 and variance 1/16 can be 
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used effectively as the weighting function. 

[0225] In addition, there is another method which uses another distribution function. The distribution function is 
formed so as to predict the error of the watermarked content data. 

[0226] The digital watermark reading apparatus 106 reconstitutes and outputs digital watermark dataiv,, , vv n _. 

5 from read watermark sequence by applying, for example. 

• si::*- *° 

10 

- 1 y mM s>. < o 

or Japanese patent application No. 10-219236 ."Embedding data coding method, the apparatus, computer readable 
is medium storing embedding data coding program, read data decoding method, the apparatus, computer readable me- 
dium storing read data decoding program, digital watermark data coding method, the apparatus, computer readable 
medium storing digital watermark coding program, digital watermark decoding method, the apparatus, computer read- 
able medium storing digital watermark decoding program". 

[0227] In addition, the above-mentioned process performed by the digital watermark reading apparatus 106 can be 
20 constructed by a program which can be stored in a computer readable medium such as a disk unit, a floppy disk, CD- 
ROM and the like. That is, by installing the program in a computer such as one shown in Fig. 17. the processes of 
watermark reading of the present invention can be carried out. In addition, the digital watermark reading apparatus of 
the present invention can be realized by the integrated circuit shown in Fig. 18. 

[0228] Experiments is performed in order to compare the method of the present invention and the conventional 
25 method of digital watermarking to motion pictures described in Japanese patent application No.9-164466. 

[0229] As experimental conditions, a unit for digital watermark processing is assumed to be a 16 ^ 16 pixel and the 
conventional digital watermark data sequence reading is assumed to be".., = (n, y mod 2) 2-1 on the basis of the 

assumptions of the above-mentioned embodiment. Watermark data is reconstituted as . 0 . w i w, M for both of the 

present invention and the conventional method. 
30 [0230] As shown in Fig. 34. it is recognized that digital watermark data reading success rate is improved in an MPEG- 
1 coded picture in any bit rates. The result shows the effectiveness of the present invention. Here, the digital watermark 
data reading success rate is obtained by dividing the number of correctly reconstituted digital watermark data by the 
total number of embedded digital watermark data. 

[0231] According to the present invention, the digital watermark data sequence is separated from the noise so that 
35 error bits which are included in the digital watermark data sequence can be reduced, thereby the digital watermark 
data reading success rate is improved in comparison with the conventional method. 

[0232] In addition, since weights are assigned to the digital watermark data sequence, the present invention is es- 
pecially effective when the repeating number of watermark embedding is small. 

[0233] The point of the present invention corresponding to the third objective is applying soft decisions for the digital 
40 watermark reading process as opposed to the conventional method which uses hard decisions. The present invention 
is not limited to the above-mentioned process and can apply to other equivalent digital watermarking method. 
[0234] In the above-mentioned embodiments corresponding to first - third objects, embodiments corresponding to 
each object can be performed with embodiments corresponding to other objects. 

[0235] The present invention is not limited to the specifically disclosed embodiments, and variations and modifications 
^5 may be made without departing from the scope of the invention. 

Claims 

50 1. A method for embedding digital watermark data in digital data contents, 
characterized in that said method comprises the steps of: 

receiving said digital data contents and said digital watermark data: 
dividing said digital data contents into block data: 
55 obtaining a frequency coefficient of said block data: 

obtaining a complexity of said block data: 

obtaining an amount of transformation of sai'i frequency coefficient from said complexity and said digital wa- 
termark data by using a quantization width 
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embedding said digital watermark data in said digital data contents by transforming said frequency coefficient 
by said amount; and 

generating watermarked digital data contents. 

2. The method as claimed in claim 1 , said step of obtaining said complexity of said block data comprising the steps of: 

transforming said block data, by applying a wavelet transform, into coefficients of said wavelet transform, and 
obtaining said complexity on the basis of the number of high frequency coefficients in said coefficients of said 
wavelet transform, each of said high frequency coefficients exceeding a threshold. 

3. A method for embedding digital watermark data in digital data contents, 
characterized in that said method comprises the steps of: 

receiving said digital data contents and said digital watermark data; 
15 dividing said digital data contents into block data; 

obtaining a frequency coefficient of said block data; 

obtaining an amount of transformation of said frequency coefficient from said digital watermark data by using 
a quantization width corresponding to said frequency coefficient, said quantization width being obtained be- 
forehand according to a manipulation method of said digital data contents; 
20 embedding said digital watermark data in said digital data contents by transforming said frequency coefficient 

by said amount; and 

generating watermarked digital data contents. 

4. The method as claimed in claim 3, 

25 wherein said quantization width is obtained by a method comprising the steps of: 

dividing first digital data contents into one or a plurality of first block data; 

dividing second digital data contents into one or a plurality of second block data, said second digital data 
contents being obtained by manipulating said first digital data contents with a predetermined manipulation 
30 method; 

transforming said first block data and said second block data into first frequency coefficients and second fre- 
quency coefficients respectively by applying an orthogonal transform; 

obtaining difference values between said first frequency coefficients and said second frequency coefficients 
for each frequency coefficient; 
35 calculating a standard deviation of distribution of said difference values; and 

obtaining said quantization width by multiplying said standard deviation by a watermark embedding strength. 

5. A method for reading digital watermark data embedded in digital data contents, 
characterized in that said method comprises the steps of: 

receiving said digital data contents; 

dividing said digital data contents into block data; 

obtaining a frequency coefficient of said block data; and 

generating digital watermark data from said frequency coefficient by using a quantization width corresponding 
45 to said frequency coefficient, said quantization width being obtained beforehand according to a manipulation 

method of said digital data contents. 

6. The method as claimed in claim 5. 

wherein said quantization width is obtained by a method comprising the steps of: 

so 

dividing first digital data contents into one or a plurality of first block data; 

dividing second digital data contents into one or a plurality of second block data, said second digital data 
contents being obtained by manipulating said first digital data contents with a predetermined manipulation 
method; 

55 transforming said first block data and said second biock data into first frequency coefficients and second fre- 

quency coefficients respectively by applying an orthogonal transform; 

obtaining difference values between said first frequency coefficients and said second frequency coefficients 
for each frequency coefficient; 
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calculating a standard deviation of distribution of said difference values: and 

obtaining said quantization width by multiplying said standard deviation by a watermark embedding strength. 

7. An apparatus for embedding digital watermark data in digital data contents. 
5 characterized in that said apparatus comprises: 

means for receiving said digital data contents and said digital watermark data: 
means for dividing said digital data contents into block data: 
means for obtaining a frequency coefficient of said block data; 
w means for obtaining a complexity of said block data; 

means for obtaining an amount of transformation of said frequency coefficient from said complexity and said 
digital watermark data by using a quantization width; 

means for embedding said digital watermark data in said digital data contents by transforming said frequency 
coefficient by said amount; and 
?5 means for generating watermarked digital data contents. 

8. The apparatus as claimed in claim 7. 

said means for obtaining said complexity of said block data comprising: 

20 means for transforming said block data, by applying a wavelet transform, into coefficients of said wavelet 

transform, and 

means for obtaining said complexity on the basis of the number of high frequency coefficients in said coeffi- 
cients of said wavelet transform, each of said high frequency coefficients exceeding a threshold. 

25 9. An apparatus for embedding digital watermark data in digital data contents, 
characterized in that said apparatus comprises: 

means for receiving said digital data contents and said digital watermark data; 
• means for dividing said digital data contents into block data; 
30 means for obtaining a frequency coefficient of said block data; 

means for obtaining an amount of transformation of said frequency coefficient from said digital watermark data 
by using a quantization width corresponding to said frequency coefficient, said quantization width being ob- 
tained beforehand according to a manipulation method of said digital data contents; 

means for embedding said digital watermark data in said digital data contents by transforming said frequency 
35 coefficient by said amount; and 

means for generating watermarked digital data contents. 

10. The apparatus as claimed in claim 9. 

wherein said quantization width is obtained by means comprising: 

40 

means for dividing first digital data contents into one or a plurality of first block data; 

means for dividing second digital data contents into one or a plurality of second block data, said second digital 
data contents being obtained by manipulating said first digital data contents with a predetermined manipulation 
method; 

45 means for transforming said first block data and said second block data into first frequency coefficients and 

second frequency coefficients respectively by applying an orthogonal transform; 

means for obtaining difference values between said first frequency coefficients and said second frequency 
coefficients for each frequency coefficient: 

means for calculating a standard deviation of distribution of said difference values; and 
50 means for obtaining said quantization width by multiplying said standard deviation by a watermark embedding 

strength. 

11. An apparatus for reading digital watermark data embedded in digital data contents, 
characterized in that said apparatus comprises: 

55 

means for receiving said digital data contents: 

means for dividing said digital data contents into block data: 

means for obtaining a frequency coefficient of said block data; and 
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means for generating digital watermark data from said frequency coefficient by using a quantization width 
corresponding to said frequency coefficient, said quantization width being obtained beforehand according to 
a manipulation method of said digital data contents. 

5 12. The apparatus as claimed in claim 11 , 

wherein said quantization width is obtained by means comprising: 

means for dividing first digital data contents into one or a plurality of first block data; 

means for dividing second digital data contents into one or a plurality of second block data, said second digital 
10 data contents being obtained by manipulating said first digital data contents with a predetermined manipulation 

method; 

means for transforming said first block data and said second block data into first frequency coefficients and 
second frequency coefficients respectively by applying an orthogonal transform; 

means for obtaining difference values between said first frequency coefficients and said second frequency 
'5 coefficients for each frequency coefficient: 

means for calculating a standard deviation of distribution of said difference values; and 

means for obtaining said quantization width by multiplying said standard deviation by a watermark embedding 
strength. 

20 13. An integrated circuit for embedding digital watermark data in digital data contents, characterized in that said inte- 
grated circuit comprises: 

means for receiving said digital data contents and said digital watermark data; 
means for dividing said digital data contents into block data; 
25 means for obtaining a frequency coefficient of said block data: 

means for obtaining a complexity of said block data; 

means for obtaining an amount of transformation of said frequency coefficient from said complexity and said 
digital watermark data by using a quantization width; 

means for embedding said digital watermark data in said digital data contents by transforming said frequency 
30 coefficient by said amount; and 

means for generating watermarked digital data contents. 

14. The integrated circuit as claimed in claim 13, said means for obtaining said complexity of said block data comprising: 

35 means for transforming said block data, by applying a wavelet transform, into coefficients of said wavelet 

transform, and 

means for obtaining said complexity on the basis of the number of high frequency coefficients in said coeffi- 
cients of said wavelet transform, each of said high frequency coefficients exceeding a threshold. 

40 1 5. An integrated circuit for embedding digital watermark data in digital data contents, characterized in that said inte- 
grated circuit comprises: 

means for receiving said digital data contents and said digital watermark data; 
means for dividing said digital data contents into block data; 
45 means for obtaining a frequency coefficient of said block data; 

means for obtaining an amount of transformation of said frequency coefficient from said digital watermark data 
by using a quantization width corresponding to said frequency coefficient, said quantization width being ob- 
tained beforehand according to a manipulation method of said digital data contents; 

means for embedding said digital watermark data in said digital data contents by transforming said frequency 
5 o coefficient by said amount; and 

means for generating watermarked digital data contents. 

16. The integrated circuit as claimed in claim 15. wherein said quantization width is obtained by means comprising: 

55 means for dividing first digital data contents into one or a plurality of first block data; 

means for dividing second digital data contents into one or a plurality of second block data, said second digital 
data contents being obtained by manipulating said first digital data contents with a predetermined manipulation 
method; 
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means for transforming said first block data and said second block data into first frequency coefficients and 
second frequency coefficients respectively by applying an orthogonal transform; 

means for obtaining difference values between said first frequency coefficients and said second frequency 
coefficients for each frequency coefficient; 
5 means for calculating a standard deviation of distribution of said difference values; and 

means for obtaining said quantization width by multiplying said standard deviation by a watermark embedding 
strength. 

17. An integrated circuit for reading digital watermark data embedded in digital data contents, characterized in that 
w said integrated circuit comprises: 

means for receiving said digital data contents; 
means for dividing said digital data contents into block data; 
means for obtaining a frequency coefficient of said block data; and 
?5 means for generating digital watermark data from said frequency coefficient by using a quantization width 

corresponding to said frequency coefficient, said quantization width being obtained beforehand according to 
a manipulation method of said digital data contents. 

18. The integrated circuit as claimed in claim 17. wherein said quantization width is obtained by means comprising: 

20 

means for dividing first digital data contents into one or a plurality of first block data; 

means for dividing second digital data contents into one or a plurality of second block data, said second digital 
data contents being obtained by manipulating said first digital data contents with a predetermined manipulation 
method; 

25 means for transforming said first block data and said second block data into first frequency coefficients and 

second frequency coefficients respectively by applying an orthogonal transform; 

means for obtaining difference values between said first frequency coefficients and said second frequency 
coefficients for each frequency coefficient; 

means for calculating a standard deviation of distribution of said difference values; and 
30 means for obtaining said quantization width by multiplying said standard deviation by a watermark embedding 

strength. 

19. A computer readable medium storing program code for causing a computer system to embed digital watermark 
data in digital data contents, characterized in that said computer readable medium comprises: 

35 

program code means for receiving said digital data contents and said digital watermark data; 
program code means for dividing said digital data contents into block data; 
program code means for obtaining a frequency coefficient of said block data; 
program code means for obtaining a complexity of said block data; 
-to program code means for obtaining an amount of transformation of said frequency coefficient from said com- 

plexity and said digital watermark data by using a quantization width; 

program code means for embedding said digital watermark data in said digital data contents by transforming 
said frequency coefficient by said amount; and 

program code means for generating watermarked digital data contents. 

-45 

20. The computer readable medium as claimed in claim 19. said program code means for obtaining said complexity 
of said block data comprising: 

program code means for transforming said block data, by applying a wavelet transform, into coefficients of 
50 said wavelet transform, and 

program code means for obtaining said complexity on the basis of the number of high frequency coefficients 
in said coefficients of said wavelet transform, each of said high frequency coefficients exceeding a threshold. 

21. A computer readable medium storing program code for causing a computer system to embed digital watermark 
55 data in digital data contents, characterized in that said computer readable medium comprises: 

program code means for receiving said digital data contents and said digital watermark data; 
program code means for dividing said digital data contents into block data; 
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program code means for obtaining a frequency coefficient of said block data; 

program code means for obtaining an amount of transformation of said frequency coefficient from said digital 
watermark data by using a quantization width corresponding to said frequency coefficient, said quantization 
width being obtained beforehand according to a manipulation method of said digital data contents; 
program code means for embe'dding said digital watermark data in said digital data contents by transforming 
said frequency coefficient by said amount; and 

program code means for generating watermarked digital data contents. 

22. The computer readable medium as claimed in claim 21. wherein said quantization width is obtained by program 
code means comprising: 

program code means for dividing first digital data contents into one or a plurality of first block data; 
program code means for dividing second digital data contents into one or a plurality of second block data, said 
second digital data contents being obtained by manipulating said first digital data contents with a predetermined 
manipulation method; 

program code means for transforming said first block data and said second block data into first frequency 
coefficients and second frequency coefficients respectively by applying an orthogonal transform; 
program code means for obtaining difference values between said first frequency coefficients and said second 
frequency coefficients for each frequency coefficient; 

program code means for calculating a standard deviation of distribution of said difference values: and 
program code means for obtaining said quantization width by multiplying said standard deviation by a water- 
mark embedding strength. 

23. A computer readable medium storing program code for causing a computer system to read digital watermark data 
embedded in digital data contents, characterized in that said computer readable medium comprises: 

program code means for receiving said digital data contents; 

program code means for dividing said digital data contents into block data; 

program code means for obtaining a frequency coefficient of said block data; and 

program code means for generating digital watermark data from said frequency coefficient by using a quan- 
tization width corresponding to said frequency coefficient, said quantization width being obtained beforehand 
according to a manipulation method of said digital data contents. 

24. The computer readable medium as claimed in claim 23, wherein said quantization width is obtained by program 
code means comprising: 

program code means for dividing first digital data contents into one or a plurality of first block data; 
program code means for dividing second digital data contents into one or a plurality of second block data, said 
second digital data contents being obtained by manipulating said first digital data contents with a predetermined 
manipulation method; 

program code means for transforming said first block data and said second block data into first frequency 
coefficients and second frequency coefficients respectively by applying an orthogonal transform; 
program code means for obtaining difference values between said first frequency coefficients and said second 
frequency coefficients for each frequency coefficient; 

program code means for calculating a standard deviation of distribution of said difference values: and 
program code means for obtaining said quantization width by multiplying said standard deviation by a water- 
mark embedding strength. 

25. A method for reading digital watermark data embedded in digital data contents, 
characterized in that said method comprises the steps of: 

receiving said digital data contents; 

reading a bit sequence from said digital data contents; 

calculating a probability of reading a bit "V or a bit '0' in said bit sequence by using a test method on the basis 
of binary distribution; 

determining the presence or absence of digital watermark data according to said probability; and 
reconstituting and generating said digital watermark data from said bit sequence. 
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26. The method as claimed in claim 25. further comprising the steps of: 



determining threshold u of reliability of digital watermark data which is read; 

obtaining a binary distribution function F(x) which represents a probability that a number x of T bits or '0' bits 
5 are included in a bit sequence which is read at random from digital data contents, said binary distribution 

function F(x) being obtained by using a probability q of reading '1' or '0* in said bit sequence and a repeating 
number of embedding each bit of digital watermark data; 

reading an ith digital watermark sequence of said digital watermark data from a digital watermark area of said 
digital data contents; 

10 calculating the number k { of 'V or '0' included in said digital watermark sequence; 

calculating a probability F(kj) by using said binary distribution function F(x); and 

reconstituting '1' or '0' from ith digital watermark data Wj if F(k } ) > a., reconstituting '0' or '1' from ith digital 
watermark data w 4 if 1-F(kj) > (/., and determining that there is no watermark data or the presence is unknown 
if both of F(kj) > (/. and 1-F(kj) > a are not satisfied. 



27. The method as claimed in claim 26. further comprising the steps of; 

outputting F(kj) as reliability if said reconstituted digital watermark data w s is 'V; and 
outputting 1 -F(kj) as the reliability if said reconstituted digital watermark data is '0'. 

28. The method as claimed in claim 25. further comprising the steps of: 



determining a threshold ex of reliability of digital watermark data which is read; 

obtaining a binary distribution function F(x) which represents a probability that a number x of '1' bits or "0' bits 
25 are included in a bit sequence which is read at random from digital data contents, said binary distribution 

function F(x) being obtained by using a probability q of reading T or '0' in said bit sequence and a repeating 
number of embedding each bit of digital watermark data; 

reading an ith digital watermark sequence of said digital watermark data from a digital watermark area of said 
digital data contents; 

30 checking whether a probability that said digital watermark sequence is digital watermark data exceeds said 

threshold a by using said binary distribution function F(x); and 

reconstituting digital watermarkdata from said digital watermark sequence by using majority decision process- 
ing if said probability exceeds u. and determining that there is no watermark data or the presence is unknown 
if said probability does not exceed a. 

35 

29. The method as claimed in claim 28. further comprising a step of outputting said probability that said digital water- 
mark sequence is digital watermark data. 

30. The method as claimed in claim 25, if a data sequence which is embedded as said digital watermark data is 
to modulated by a pseudo-random sequence, said method further comprising the steps of: 

demodulating said bit sequence by said pseudo-random sequence; and 
reconstituting digital watermark data from said demodulated bit sequence. 

45 31. The method as claimed in claim 25. if a data sequence which is embedded as said digital watermark data is 
modulated by a pseudo-random sequence, said method further comprising the steps of: 

determining a threshold a of reliability of digital watermark data which is read; 

obtaining a binary distribution function F(x) which represents a probability that a number of x of T bits or '0* 
50 bits are included in a bit sequence which is read at random from digital data contents, said binary distribution 

function F(x) being obtained by using a probability q of reading T or "0" in said bit sequence and a repeating 
number of embedding each bit of digital watermark data; 

reading an ith digital watermark sequence of said digital watermark data from a digital watermark area of said 
digital data contents; 

55 demodulating said digital watermark sequence by said pseudo-random sequence: 

assigning 1/2 to said probability q; 

obtaining a maximum number x 0 which satisfies 0 - " F(x=x 0 ) - ~1 -</. and a minimum number x n which satisfies 
a<F(x=x l )^1; 
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obtaining the number kj of 'V or '0' included in said ith digital watermark sequence; and 

reconstituting ith digital watermark data w { as '0' or '1' if k^x^ and reconstituting said ith digital watermark 
data Wj as '1' or "0' if kj^x v 

5 32. The method as claimed in claim 25. if a data sequence which is embedded as said digital watermark data is 
modulated, by a pseudo-random sequence, said method further comprising the steps of: 

determining a threshold a of reliability of digital watermark data which is read; 

obtaining a binary distribution function F(x) which represents a probability that x of "V bits or'0' bits are included 
in a bit sequence which is read at random from digital data contents, said binary distribution function F(x) being 
obtained by using a probability q of reading '1 ' or'0' in said bit sequence and a repeating number t of embedding 
each bit of digital watermark data: 

reading an ith digital watermark sequence of said digital watermark data from a digital watermark area of said 
digital data contents; 

demodulating said digital watermark sequence by said pseudo-random sequence: 
assigning 1/2 to said probability q; 

obtaining x 0 or x 1 which satisfies 0< F(x=x 0 ) < 1-(/. or uf5 F(x=x 1 ) 5 1 ; 

determining whether a value is equal to or less than Xq or equal to or more than x 1( said value being a mean 
value of absolute values of a difference between the number of '0' or 'V included in said ith digital watermark 
sequence and a central value qXt of a binary distribution; 

reconstituting digital watermark data by performing majority decision processing for said ith digital watermark 
sequence if said value is equal to or less than Xq or equal to or more than x v and 

determining that there is no digital watermark data or the presence is unknown if said value is not equal to or 
less than x 0 or equal to or more than x v 

33. The method as claimed in claim 32. further comprising the steps of: 

calculating a value of said binary distribution function F(z). z being said mean value obtained from the number 
of '0' or "V included in said ith digital watermark sequence and said central value q^t; and 
30 outputting said value of F(z) as reliability of digital watermark data. 

34. An apparatus for reading digital watermark data embedded in digital data contents, characterized in that said 
apparatus comprises: 

means for receiving said digital data contents; 
means for reading a bit sequence from said digital data contents; 

means for calculating a probability of reading a bit Tora bit '0' in said bit sequence by using a test method 
on the basis of binary distribution; 

means for determining the presence or absence of digital watermark data according to said probability; and 
means for reconstituting said digital watermark data from said bit sequence. 

35. The apparatus as claimed in claim 34, further comprising: 

means for obtaining a binary distribution function F(x) which represents a probability that a number x of '1 ' bits 
->5 or '0' bits are included in a bit sequence which is read at random from digital data contents, said binary distri- 

bution function F(x) being obtained by using a probability q of reading T or '0' in said bit sequence and a 
repeating number of embedding each bit of digital watermark data; 

means for reading an ith digital watermark sequence of said digital watermark data from a digital watermark 
area of said digital data contents: 

50 means for calculating the number kj of 'V or '0' included in said digital watermark sequence; 

means for calculating a probability F(kj) by using said binary distribution function F(x); and 
means for reconstituting '1' or '0" from ith digital watermark data Wj if F(kj) > </.. reconstituting '0' or 'V from ith 
digital watermark data Wj if 1 -F( k ; ) > </.. and. determining that there is no watermark data or the presence is 
unknown if both of F(kj) > a and 1 -F(k s ) > </. are not satisfied . u being a threshold of reliability of digital watermark 

55 data which is read. 

36. The apparatus as claimed in claim 35. further comprising: 
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means for outputting F(kj) as reliability if said reconstituted digital watermark data Wj is T; and 
means for outputting 1 -F(kj) as reliability if said reconstituted digital watermark data Wj is '0'. 

37. The apparatus as claimed in claim 34. further comprising: 

means, for obtaining a binary distribution function F(x) which represents a probability that a number x of 'V bits 
or '0' bits are included in a bit sequence which is read at random from digital data contents, said binary distri- 
bution function F(x) being obtained by using a probability q of reading '1' or *0' in said bit sequence and a 
repeating number of embedding each bit of digital watermark data; 

means for reading an ith digital watermark sequence of said digital watermark data from a digital watermark 
area of said digital data contents; 

means for checking whether a probability that said digital watermark sequence is digital watermark data ex- 
ceeds said threshold a by using said binary distribution function F(x). (/. being a threshold of reliability of digital 
watermark data which is read; and 

means for reconstituting and generating digital watermark data from said digital watermark sequence by using 
majority decision processing if said probability exceeds (/., and. determining that there is no watermark data 
or the presence is unknown if said probability does not exceed a. 

38. The apparatus as claimed in claim 37, further comprising means for outputting said probability that said digital 
20 watermark sequence is digital watermark data. 

39. The apparatus as claimed in claim 34. if a data sequence which is embedded as said digital watermark data is 
modulated by a pseudo-random sequence, said apparatus further comprising: 

25 means for demodulating said bit sequence by said pseudo-random sequence; and 

means for reconstituting digital watermark data from said demodulated bit sequence. 

40. The apparatus as claimed in claim 34, if a data sequence which is embedded as said digital watermark data is 
modulated by a pseudo-random sequence, said apparatus further comprising: 

means for obtaining a binary distribution function F(x) which represents a probability that a number x of 'V bits 
or "0' bits are included in a bit sequence which is read at random from digital data contents, said binary distri- 
bution function F(x) being obtained by using a probability q of reading '1' or '0' in said bit sequence and a 
repeating number of embedding each bit of digital watermark data; 

means for reading an ith digital watermark sequence of said digital watermark data from a digital watermark 
area of said digital data contents; 

means for demodulating said digital watermark sequence by said pseudo-random sequence; 
means for assigning 1/2 to said probability q; 

means for obtaining a maximum number Xq which satisfies 0^F(x=x o )^ 1-a and a minimum number x^ which 
satisfies a^F(x=x 1 )^1 ,a being a threshold of reliability of digital watermark data which is read; 
means for obtaining the number kj of '1' or '0' included in said ith digital watermark sequence; and 
means for reconstituting ith digital watermark data w s as '0' or T if kj<x 0 . and, reconstituting said ith digital 
watermark data Wj as '1* or '0' if kj^x v 

45 41. The apparatus as claimed in claim 34, if a data sequence which is embedded as said digital watermark data is 
modulated by a pseudo-random sequence, said apparatus further comprising: 

means for obtaining a binary distribution function F(x) which represents a probability that a number x of T bits 
or '0' bits are included in a bit sequence which is read at random from digital data contents, said binary distri- 
50 bution function F(x) being obtained by using a probability q of reading 'V or '0' in said bit sequence and a 

repeating number t of embedding each bit of digital watermark data; 

means for reading an ith digital watermark sequence of said digital watermark data from a digital watermark 
area of said digital data contents; 

means for demodulating said digital watermark sequence by said pseudo-random sequence; means for as- 
55 signing 1/2 to said probability q: 

means for obtaining x 0 or which satisfies 0- F(x=x 0 )- 1-u or (/.<F(x=x 1 )?1 .« being a threshold of reliability 
of digital watermark data which is read; 

means for determining whether a value is equal to or less than Xq or equal to or more than x v said value being 
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a mean value of absolute values of a difference between the number of '0' or 'V included in said ith digital 
watermark sequence and a central value q^t of a binary distribution; 

means for reconstituting digital watermark data by performing majority decision processing for said ith digital 
watermark sequence if said value is equal to or less than Xq or equal to or more than and 
means for determining that there is no digital watermark data or the presence is unknown if said value is not 
equal to or less than x 0 or equal to or more than x v 

The apparatus as claimed in claim 41. further comprising: 

means for calculating a value of said binary distribution function F(z), z being said mean value obtained from 
the number of '0' or '1' included in said ith digital watermark sequence and said central value q^.t: and 
means for outputting said value of F(z) as reliability of digital watermark data. 

An integrated circuit for reading digital watermark data embedded in digital data contents, characterized in that 
said integrated circuit comprises: 

means for receiving said digital data contents: 

means for reading a bit sequence from said digital data contents; 

means for calculating a probability of reading a bit 'V or a bit '0' in said bit sequence by using a test method 
on the basis of binary distribution; 

means for determining the presence or absence of digital watermark data according to said probability; and 
means for reconstituting and generating said digital watermark data from said bit sequence. 

The integrated circuit as claimed in claim 43. further comprising: 

means for obtaining a binary distribution function F(x) which represents a probability that a number x of '1 ' bits 
or '0' bits are included in a bit sequence which is read at random from digital data contents, said binary distri- 
bution function F(x) being obtained by using a probability q of reading '1' or '0" in said bit sequence and a 
repeating number of embedding each bit of digital watermark data; 

means for reading an ith digital watermark sequence of said digital watermark data from a digital watermark 
area of said digital data contents: 

means for calculating the number k s of '1' or '0' included in said digital watermark sequence; 

means for calculating a probability F(kj) by using said binary distribution function F(x); and 

means for reconstituting '1' or '0' from ith digital watermark data Wj if F(kj) > a. reconstituting '0' or T from ith 

digital watermark data Wj if 1 -F(k i ) > a, and determining that there is no watermark data or the presence is 

unknown if both of F(kj) > (/.and 1 -F(kj) > a. are not satisfied, u being a threshold of reliability of digital watermark 

data which is read. 

The integrated circuit as claimed in claim 44. further comprising: 

means for outputting F(kj) as reliability if said reconstituted digital watermark data Wj is '1'; and 
means for outputting 1 -F( kj) as reliability if said reconstituted digital watermark data w s is '0'. 

The integrated circuit as claimed in claim 43. further comprising: 

means for obtaining a binary distribution function F(x) which represents a probability that a number of x of 'V 
bits or '0' bits are included in a bit sequence which is read at random from digital data contents, said binary 
distribution function F(x) being obtained by using a probability q of reading '1' or '0' in said bit sequence and 
a repeating number of embedding each bit of digital watermark data; 

means for reading an ith digital watermark sequence of said digital watermark data from a digital watermark 
area of said digital data contents; 

means for checking whether a probability that said digital watermark sequence is digital watermark data ex- 
ceeds said threshold (/. by using said binary distribution function F(x). a. being a threshold of reliability of digital 
watermark data which is read; and 

means for reconstituting and generating digital watermark data from said digital watermark sequence by using 
majority decision processing if said probability exceeds u. and. determining that there is no watermark data 
or the presence is unknown if said probability ^ioes not exceed (/.. 
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47. The integrated circuit as claimed in claim 46. further comprising means for outputting said probability that said ' 
digital watermark sequence is digital watermark data. 

48. The integrated circuit as claimed in claim 43, if a data sequence which is embedded as said digital watermark data 
is modulated by a pseudo-random sequence, said integrated circuit further comprising: 

means for demodulating said bit sequence by said pseudo-random sequence; and 
means for reconstituting digital watermark data from said demodulated bit sequence. 

49. The integrated circuit as claimed in claim 43. if a data sequence which is embedded as said digital watermark data 
is modulated by a pseudo-random sequence, said integrated circuit further comprising: 

means for obtaining a binary distribution function F(x) which represents a probability that a number x of '1' bits 
or '0' bits are included in a bit sequence which is read at random from digital data contents, said binary distri- 
bution function F(x) being obtained by using a probability q of reading '1' or '0* in said bit sequence and a 
repeating number of embedding each bit of digital watermark data: 

means for reading an ith digital watermark sequence of said digital watermark data from a digital watermark 
area of said digital data contents: 

means for demodulating said digital watermark sequence by said pseudo-random sequence; means for as- 
signing 1/2 to said probability q; 

means for obtaining a maximum number Xq which satisfies 0--- : F(x=x 0 ) < 1-(/. and a minimum number x-, which 
satisfies u^F(x=x 1 )^ 1 ,u being a threshold of reliability of digital watermark data which is read: and 
means for obtaining the number kj of '1* or '0' included in said ith digital watermark sequence; 
means for reconstituting ith digital watermark data w 4 as '0' or T if kj-^XQ, and, reconstituting said ith digital 
watermark data Wj as '1* or '0* if kj- = "X 1 . 

50. The integrated circuit as claimed in claim 43, if a data sequence which is embedded as said digital watermark data 
is modulated by a pseudo-random sequence, said integrated circuit further comprising: 

means for obtaining a binary distribution function F(x) which represents a probability that a number x of ' 1 ' bits 
or '0' bits are included in a bit sequence which is read at random from digital data contents, said binary distri- 
bution function F(x) being obtained by using a probability q of reading 'V or '0' in said bit sequence and a 
repeating number t of embedding each bit of digital watermark data; 

means for reading an ith digital watermark sequence of said digital watermark data from a digital watermark 
area of said digital data contents; 

means for demodulating said digital watermark sequence by said pseudo-random sequence; 
means for assigning 1/2 to said probability q; 

means for obtaining x 0 or x., which satisfies 0 <F(x=x 0 )^ 1-u or (/.■ I >F(x=x 1 ): c f 1 ,a. being a threshold of reliability 
of digital watermark data which is read; 

means for determining whether a value is equal to or less than Xq or equal to or more than x v said value being 
a mean value of absolute values of a difference between the number of '0' or 'V included in said ith digital 
watermark sequence and a central value qXt of a binary distribution; 

means for reconstituting digital watermark data by performing majority decision processing for said ith digital 
watermark sequence if said value is equal to or less than Xq or equal to or more than x-,; and 
means for determining that there is no digital watermark data or the presence is unknown if said value is not 
equal to or less than x 0 or equal to or more than x v 

51. The integrated circuit as claimed in claim 50. further comprising: 

means for calculating a value of said binary distribution function F(z). z being said mean value obtained from 
the number of '0' or 'V included in said ith digital watermark sequence and said central value q^t; and 
means for outputting said value of F(z) as reliability of digital watermark data. 

52. A computer readable medium storing program code for causing a computer system to read digital watermark data 
embedded in digital data contents, characterized in (hat said computer readable medium comprises: 

program code means for receiving said digital -'Jala contents; 

program code means for reading a bit sequ-no-r from said digital data contents; 
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program code means for calculating a probability of reading a bit 'V or a bit '0' in said bit sequence by using 
a test method on the basis of binary distribution; 

program code means for determining the presence or absence of digital watermark data according to said 
probability; and 

5 program code means for reconstituting and generating said digital watermark data from said bit sequence. 

53. The computer readable medium as claimed in claim 52. further comprising: 

program code means for obtaining a binary distribution function F(x) which represents a probability that a 
io number x of '1' bits or '0' bits are included in a bit sequence which is read at random from digital data contents, 

said binary distribution function F(x) being obtained by using a probability q of reading T or '0* in said bit 
sequence and a repeating number of embedding each bit of digital watermark data; 

program code means for reading an ith digital watermark sequence of said digital watermark data from a digital 
watermark area of said digital data contents; 

f5 program code means for calculating the number of ' 1 ' or '0' included in said digital watermark sequence; and 

program code means for calculating a probability F(kj) by using said binary distribution function F(x); 
program code means for reconstituting 'V or '0' from ith digital watermark data w { if F(k 4 ) > (/., reconstituting '0' 
or T from ith digital watermark data Wj if 1-F(kj) > (/.. and. determining that there is no watermark data or the 
presence is unknown if both of F(k,) > u and 1 -F(kj) > (/. are not satisfied, a being a threshold of reliability of 

20 digital watermark data which is read. 

54. The computer readable medium as claimed in claim 53. further comprising: 

program code means for outputting F(kj) as reliability if said reconstituted digital watermark data w { is "V; and 

25 

program code means for outputting 1-F(kj) as reliability if said reconstituted digital watermark data w s is '0'. 

55. The computer readable medium as claimed in claim 52. further comprising: 

30 program code means for obtaining a binary distribution function F(x) which represents a probability that a 

number x of '1 ' bits or '0' bits are included in a bit sequence which is read at random from digital data contents, 
said binary distribution function F(x) being obtained by using a probability q of reading T or '0' in said bit 
sequence and a repeating number of embedding each bit of digital watermark data; 

program code means for reading an ith digital watermark sequence of said digital watermark data from a digital 
35 watermark area of said digital data contents: 

program code means for checking whether a probability that said digital watermark sequence is digital water- 
mark data exceeds said threshold u by using said binary distribution function F(x). a. being a threshold of 
reliability of digital watermark data which is read: and 

program code means for reconstituting and generating digital watermark data from said digital watermark 
JO sequence by using majority decision processing if said probability exceeds a. and determining that there is 

no watermark data or the presence is unknown if said probability does not exceed a. 

56. The computer readable medium as claimed in claim 55. further comprising program code means for outputting 
said probability that said digital watermark sequence is digital watermark data as reliability of said reconstituted 

-to digital watermark data. 

57. The computer readable medium as claimed in claim 52. if a data sequence which is embedded as said digital 
watermark data is modulated by a pseudo-random sequence, said computer readable medium further comprising: 

50 program code means for demodulating said bit sequence by said pseudo-random sequence; and 

program code means for reconstituting digital watermark data from said demodulated bit sequence. 

58. The computer readable medium as claimed in claim 52. if data sequence which is embedded as said digital wa- 
termark data is modulated by a pseudo-random sequence, said computer readable medium further comprising: 

55 

program code means for obtaining a binary distribution function F(x) which represents a probability that a 
number x of '1 ' bits or '0' bits are included in a bit sequence which is read at random from digital data contents, 
said binary distribution function F(x) being obtained by using a probability q of reading 'V or '0* in said bit 
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sequence and a repeating number of embedding each bit of digital watermark data; 

program code means for reading an ith digital watermark sequence of said digital watermark data from a digital 
watermark area of said digital data contents; 

program code means for demodulating said digital watermark sequence by said pseudo-random sequence; 
5 program code means for assigning 1/2 to said probability q; 

program code means for obtaining a maximum number Xq which satisfies 0^F(x=x 0 )-^1-u and a minimum 
number x-, which satisfies a^F(x=x 1 )$1, a being a threshold of reliability of digital watermark data which is 
read; 

program code means for obtaining the number k^ of 'V or '0' included in said ith digital watermark sequence; and 
w program code means for reconstituting ith digital watermark data w ( as '0' or'V if kj55x 0 , and reconstituting said 

ith digital watermark data w f as '1' or '0' if k^x^ 

59. The computer readable medium as claimed in claim 52, if a data sequence which is embedded as said digital 
watermark data is modulated by a pseudo-random sequence, said computer readable medium further comprising: 

15 

program code means for obtaining a binary distribution function F(x) which represents a probability that a 
number x of '1' bits or '0' bits are included in a bit sequence which is read at random from digital data contents, 
said binary distribution function F(x) being obtained by using a probability q of reading 'V or '0' in said bit 
sequence and a repeating number t of embedding each bit of digital watermark data; 
20 program code means for reading an ith digital watermark sequence of said digital watermark data from a digital 

watermark area of said digital data contents: 

program code means for demodulating said digital watermark sequence by said pseudo-random sequence; 
program code means for assigning 1/2 to said probability q; 

program code means forobtaining XqCM-x, which satisfies 0 < rF(x=x o )^ 1 ■uor«s*F(x=x 1 )? 1 .(/.being a threshold 
25 of reliability of digital watermark data which is read; 

program code means for determining whether a value is equal to or less than Xq or equal to or more than x v 
said value being a mean value of absolute values of a difference between the number of '0" or T included in 
said ith digital watermark sequence and a centra! value q^t of a binary distribution; 

program code means for reconstituting digital watermark data by performing majority decision processing for 
30 said ith digital watermark sequence if said value is equal to or less than Xq or equal to or more than x } ; and 

program code means for determining that there is no digital watermark data or the presence is unknown if 
said value is not equal to or less than x 0 or equal to or more than x v 

60. The computer readable medium as claimed in claim 59. further comprising: 

35 

program code means for calculating a value of said binary distribution function F(z). z being said mean value 
obtained from the number of '0' or T included in said ith digital watermark sequence and said central value 
q^t; and 

program code means for outputting said value of F(z) as reliability of digital watermark data. 

AO 

61 . A method for reading digital watermark data from digital data contents in which each bit of digital watermark data 
is embedded a plurality of times, characterized in that said method comprises the steps of: 

receiving digital data contents; 
45 reading a digital watermark sequence from said digital data contents; 

performing soft decision in code theory by assigning weights to said digital watermark sequence with a weight- 
ing function; and 

reconstituting and generating digital watermark data from said digital watermark sequence. 

50 62. The method as claimed in claim 61 . wherein said weighting function is a distribution function obtained by a method 
comprising the steps of: 

dividing first digital data contents into one or a plurality of first block data: 

dividing second digital data contents into one or a plurality of second block data, said second digital data 
55 contents being obtained by manipulating said first digital data contents with a predetermined manipulation 

method; 

transforming said first block data and said second block data into first frequency coefficients and second fre- 
quency coefficients respectively by applying an orthogonal transform; and 
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obtaining a distribution of difference values between said first frequency coefficients and said second frequency 
coefficients, said distribution function being an approximation of said distribution, 

wherein said weights are assigned to said digital watermark sequence according to values of said distribution 
function. 

5 

63. The method as claimed in claim 61 , wherein said weighting function is a distribution function obtained by a method 
comprising the steps of: 

dividing first digital data contents into one or a plurality of first block data; 
to dividing second digital data contents into one or a plurality of second block data, said second digital data 

contents being obtained by manipulating said first digital data contents with a predetermined manipulation 
method; 

transforming said first block data and said second block data into first frequency coefficients and second fre- 
quency coefficients respectively by applying an orthogonal transform; and 
15 obtaining said distribution function on the basis of a theory if a distribution of difference values between said 

first frequency coefficients and said second frequency coefficients can be obtained by said theory, 
wherein said weights are assigned to said digital watermark sequence according to values of said distribution 
function. 

20 64. An apparatus for reading digital watermark data from digital data contents in which each bit of digital watermark 
data is embedded a plurality of times, characterized in that said apparatus comprises: 

means for receiving digital data contents; 

means for reading a digital watermark sequence from said digital data contents; 
25 means for performing soft decision in code theory by assigning weights to said digital watermark sequence 

with a weighting function; and 

means for reconstituting and generating digital watermark data from said digital watermark sequence. 

65. The apparatus as claimed in claim 64, wherein said weighting function is a distribution function obtained by means 
^0 comprising: 

means for dividing first digital data contents into one or a plurality of first block data; 

means for dividing second digital data contents into one or a plurality of second block data, said second digital 
data contents being obtained by manipulating said first digital data contents with a predetermined manipulation 
35 method; 

means for transforming said first block data and said second block data into first frequency coefficients and 
second frequency coefficients respectively by applying an orthogonal transform; and 

means for obtaining a distribution of difference values between said first frequency coefficients and said second 
frequency coefficients, said distribution function being an approximation of said distribution, 
JO wherein said weights are assigned to said digital watermark sequence according to values of said distribution 

function. 

66. The apparatus as claimed in claim 64, wherein said weighting function is a distribution function obtained by means 
comprising: 

means for dividing first digital data contents into one or a plurality of first block data; 

means for dividing second digital data contents into one or a plurality of second block data, said second digital 
data contents being obtained by manipulating said first digital data contents with a predetermined manipulation 
method; 

50 means for transforming said first block data and said second block data into first frequency coefficients and 

second frequency coefficients respectively by applying an orthogonal transform ; 

means for obtaining said distribution function on the basis of a theory if a distribution of difference values 
between said first frequency coefficients and said second frequency coefficients can be obtained by said theory 
and 

55 wherein said weights are assigned to said digital watermark sequence according to values of said distribution 

function. 

67. An integrated circuit for reading digital watermark data from digital data contents in which each bit of digital wa- 
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termark data is embedded a plurality of times, characterized in that said integrated circuit comprises: 
means for receiving digital data contents: 

means for reading a digital watermark sequence from said digital data contents: 

means for performing soft decision in code theory by assigning weights to said digital watermark sequence 
with a weighting function; and 

means for reconstituting and generating digital watermark data from said digital watermark sequence. 

The integrated circuit as claimed in claim 67, wherein said weighting function is a distribution function obtained by 
means comprising : 

means for dividing first digital data contents into one or a plurality of first block data; 

means for dividing second digital data contents into one or a plurality of second block data, said second digital 
data contents being obtained by manipulating said first digital data contents with a predetermined manipulation 
method: 

means for transforming said first block data and said second block data into first frequency coefficients and 
second frequency coefficients respectively by applying an orthogonal transform; and 

means for obtaining a distribution of difference values between said first frequency coefficients and said second 

frequency coefficients, said distribution function being an approximation of said distribution. 

wherein said weights are assigned to said digital watermark sequence according to values of said distribution 

function. 

The integrated circuit as claimed in claim 67, wherein said weighting function is a distribution function obtained by 
means comprising: 

means for dividing first digital data contents into one or a plurality of first block data; 

means for dividing second digital data contents into one or a plurality of second block data, said second digital 
data contents being obtained by manipulating said first digital data contents with a predetermined manipulation 
method: 

means for transforming said first block data and said second block data into first frequency coefficients and 
second frequency coefficients respectively by applying an orthogonal transform; and 

means for obtaining said distribution function on the basis of a theory if a distribution of difference values 
between said first frequency coefficients and said second frequency coefficients can be obtained by said theory, 
wherein said weights are assigned to said digital watermark sequence according to values of said distribution 
function. 

A computer readable medium storing program code for causing a computer system to read digital watermark data 
from digital data contents in which each bit of digital watermark data is embedded a plurality of times, characterized 
in that said computer readable medium comprises: 

program code means for receiving digital data contents; 

program code means for reading a digital watermark sequence from said digital data contents; 

program code means for performing soft decision in code theory by assigning weights to said digital watermark 

sequence with a weighting function; and 

program code means for reconstituting and generating digital watermark data from said digital watermark 
sequence. 

The computer readable medium as claimed in claim 70, wherein said weighting function is a distribution function 
obtained by program code means comprising: 

program code means for dividing first digital data contents into one or a plurality of first block data; 
program code means for dividing second digital data contents into one or a plurality of second block data, said 
second digital data contents being obtained by manipulating said first digital data contents with a predetermined 
manipulation method; 

program code means for transforming said first block data and said second block data into first frequency 
coefficients and second frequency coefficients respectively by applying an orthogonal transform; and 
program code means for obtaining a distribution of difference values between said first frequency coefficients 
and said second frequency coefficients, said distribution function being an approximation of said distribution. 
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wherein said weights are assigned to said digital watermark sequence according to values of said distribution 
function. 

72. The computer readable medium as claimed in claim 70. wherein said weighting function is a distribution function 
5 obtained by program code means comprising: 

program code means for dividing first digital data contents into one or a plurality of first block data; 
program code means for dividing second digital data contents into one or a plurality of second block data, said 
second digital data contents being obtained by manipulating said first digital data contents with a predetermined 

10 manipulation method; 

program code means for transforming said first block data and said second block data into first frequency 
coefficients and second frequency coefficients respectively by applying an orthogonal transform; and 
program code means for obtaining said distribution function on the basis of a theory if a distribution of difference 
values between said first frequency coefficients and said second frequency coefficients can be obtained by 

15 said theory, 

wherein said weights are assigned to said digital watermark sequence according to values of said distribution 
function. 
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